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A THEORY OF EANDOMNEf^S 
By M. G. KENDALL 
Introd rroTiON 

1 . In two recient papers Babington Smith & I (1938 and 1939) have discussed 
tlie problems of sainjiling with random numbers and the construction of tables 
of such numbers by mechanical methods. With the publication of 100,000 
numbcns (1940) what one may call the practical side of the investigation has 
come to an end. The purpose of this paper is to develop the theory of the subject 
ami 1 o jmt in their proper setting some of the ideas on which the practical re- 
.senreli was based. It is divided into two parts. In the first I deal with the 
.symbols and mathematics of the theory of random suites, my fundamental 
contention being that a theory of randomness can be developed within the 
fraiinework of existing mathematical notions. The second part indicates how the 
tluiory is to be related to jiractice. 

2. Much of the following work was suggested by the treatment of von Mises 
(lf)3()) and Ddrgc (1934), and I take the opportunity of expressing my indebted- 
nes.s to them. The principal difference between von Mises’s views and my own 
concerns ids concept of the Iixegular Kollektiv, or infinite random series. 
Numerous attempts have been made to show that this eonoept leads to a contra- 
diction and that it is therefore an improper foundation for a theory of proba- 
bility. Such attempts have mostly failed, but under pressure of the oriticisms 
embodied in them the definition of the Irregular Kollektiv ha.s been successively 
modified by von Mises’s followers until it has lost the pristine simplicity which 
was originally one of its most attractive features. I do not propose to discuss 
here the difficidtie.s associated with the concept of the Irregular Kollektiv or the 
various expedients which have been propo.sed to meet them. I have tried to cut 
the Gordian knot by rejecting the concept, and the theory below accordingly 
avoids all the difficulties attendant iij)on it. 

r.\RT 1. The theory of raudom suites 

3. I consider a finite number r of symbols A^, A.^., ..., each of which will 
be called a characteristic, and an infinite ordered series of these characteristics, 
which will be called a suite. For instance, if there were two characteristics 
and such a suite might be 

A2 ..., {!■) 

\^'here the characteristics appear alternately. Suites exist in the sense that they 

Biometrika xxxii ^ 



2 A theory of randomness 

can be completely specified by a law of formation, as the foregoing example 
shows. 

4. Definition, If the proportional frequency of each characteristic in. a suite 
tends to a limit in the mathematical sense, the suite is called “proper”. In the 
contrary case, “improper”. 

Proper suites exist; e.g. the proportional frequencies of jd^’s and .d^’s in (1) 
tend each to the limit 

Improper suites also exist. For example, it may be shown that if we take the 
ith digit in the logarithms to base 10 of all the integers, beginning with 1, the 
suite of the digits 0-9 so obtained is improper, since no proportional frequency 
tends to a limit. 

Suites also exist which are proper for one characteristic and improper for 
another, provided that there are more than two characteristics. For we may 
build a suite from the logarithm table in the manner just described, and then 
insert a new characteristic Q between successive digits. The proportional 
frequency of Q will then tend to but those of the others will not tend to a limit. 
If, however, there are two characteristics, the proportional frequencies, being 
together equal to unity, must tend to a limit together. 

6. Definition, The limit of the proportional frequency of a characteristic in 
a proper suite is called the probability of that characteristic in that suite, 

6. Definition. By a “Selector” I mean an infinite series of positive integers 
ordered according to their magnitude. A selector, being infinite, must be 
specified by a law of formation, not by ennmeration. 

There is a special class of such laws which deserves separate consideration. 
Suppose we have such a suite as this : 

( 2 ) 

characteristics after the first appearing alternately in pairs. 

Our law of formation of the selector might be in these terms ; proceed along 
the series until you come to the combination then choose the ordinal 

number of the next following member of the series, and proceed until you again 
meet that combination; and so on. The series of ordinals so obtained is the 
selector. 

The importance of selectors of this type is that they are mathematically 
independent of the particular characteristic of the member whose 'Ordinal 
number is chosen. By mathematically independent I mean that the value of this 
member does not appear in the law of formation, so that the same member would 
be chosen whatever its characteristic. 

Definiticm. If a selector is constructed from a suite and, in virtue of the law 
of formation, any member of the selector is mathematically independent of the 
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characteristic whose ordinal number in the suite is the value of that member, 
the selector is said to be ‘‘disjoint’' with respect to the suite. 

7. It might happen that a law of formation of a disjoint selector was given 
which did not in fact lead to a selector in the case of certain suites. For example, 
with the suite (2), if we try to construct a selector by choosing ordinals corre- 
sponding to the characteristics following three successive Ai’a, no ordinals 
appear. Such a law I should regard as degenerate in relation to that suite, and 
I exclude it from the domain of discussion from this point onwards. Hereafter, 
in speaking of a selector in relation to a given suite I shall assume that the one 
is disjoint with respect to the other. 

8. We may now apply selectors to pick out subsets from a suite. We do so by 
choosing from the suite those members whose ordinals are the numbers appearing 
in the selector. 

Ex hypothesi, the result of this process will be a new suite of the charac- 
teristics (some at least) of the original suite. We may call this a “ Derived suite ”. 
Symbolically, denoting the selector by the roman S and the suite by K, we may 
write D==Sif. (3) 

I proceed to prove one or two theorems of a negative kind about derived 
suites. 

9. A suite derived from a proper suite is not necessarily proper. 

For let K - ..., 

S = 1, 2, 4, 6, 7, 9, 11, 13, 15, 16, 18, 20, 22, ..., 
the numbers running alternately in sets of even and odd, the number of each 
kind being equal to twice the number of preceding members of the selector. 
Then gjf _ Ai^AzA^A^A^A^A^A^A-i^A^A^A^Aj^A^A^ .... 

However far we go in this series, say to the end of a run of A^’s, there will follow 
twice as many A^’s as there have already occurred of both Aj’a and A.^’s. Clearly 
the suite is improper. 

10. If we apply another selector Sj to a derived suite we get a further 
derived suite which we may write SgSj A. It is clear that this will not in general 
be the same as 

The “identical” selector E = 1, 2, 3, 4, ... is of some importance. Clearly it 
reproduces a suite to which it is applied, and ESiA' = SjEA, etc. 

Randomness 

11. Definition. If the probability of the characteristic A^ in a proper suite 
is p ; and if the probability of A^ in a proper suite derived from it by the selector 
S is also p ; then the suite is said to be random for the characteristic Aj with 
respect to S. 

Suites and selectors with this property exist. Every proper suite is random 
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with respect to the identical selector E, and the suite (1) is random for both ^4^ 
and ^2 respect to the selector 

1,4, 7, 10,..., 

though not to the selector 1, 3, 5, 7, .... 

12. Definition. A suite which is random for a characteristic A with respect 
to a number of selectors , S2, . . ., S„, is said to be random in the selector domain 

SnS2,...,S,„. _ _ 

It is to be noted that if a suite is random with respect to and 02 it does not 
follow that S^K is random with respect to Sj or S^K with respect to Si- E.g. if 

K ~ A^A^A^A^Aj^AiA^A^ ..., 

82 = 1 , 3, 5, 7,. 9...., 

§2 = the disjoint selector obtained by writing down the ordinals of charac- 
teristics next following A^, 
then = A^A^Aj^A^A^A ^ ..., 

S 2 E = A-j^A^Ay^A2Aj^A2 . . i , 

so that K is random for and A^ with respect to both 83, and 82. But 

S 2 S 2 -K = A^A^A^A^A^A ^ ..., 

S2S2JE = AiA.^^A.yA^Ay^A.j^ .... 

13. Given a suite and a certain finite set of selectors, we may consider the 
suites obtained by repeated applications of groups of these selectors. This will 
give us a series of derived suites which may be infinite but is nevertheless 
ordered. If all the resulting suites are proper and the probability of a charac- 
teristic A in them all is the same as that in the parent suite, the latter is said to 
be completely random in the selector domain Sj, Sj, ..., S„,. 

There exist suites which are completely random in certain domains. E.g. if 
E — A^A^A^iA^A-^A^ 

Si =1,4, 7, 10,..., 

then Si J: = A^A^AiA^A^^A^ ...==K, 

so that repeated applications of Si lead only back to the original suite. 

Consider further Sg = 2, 6, 8, 11, .... 

Then Sgif = A^AiA^A^A^A^ ... 

SiE = did2did2di4.... 

Thus, any number of applications of Sj and Sg lead either to 4i dgdidgdidg . . ■ 
or to dgdidgdidgdi..., and hence the suite is completely random for di and 
dg with respect to Si and Sg. 

^ It follows at once that any suite which is derived from a completely random 
suite by a selector of the set is also completely random within the same domain. 
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14. The foregoing examples, though trivial enough, show that the various 
ideas which have been introduced are not self-contradictory, and that they fall 
within the scope of ordinary mathematical concepts. But the use of the word 
“random” to describe a state of affairs which is the reverse of what is ordinarily 
understood by the word requires some scholium. I have, for instance, remarked 
that any suite is random with respect to the identical selector 1, 2, 3, 4, .... But 
surely, it may be said, such a suite as AiAiAj^AiAiAiA^Aj^Ai... is the very 
reverse of random, being as systematic as any such series can be ? I will antici- 
pate a later , part of this paper to some extent by a short explanation of this 
point. 

15. In statistical work we require one thing above all in a “random” 
selection ; namely, that if continued long enough it shall draw all members of the 
universe equally often, or at least in a known proportion. In fact, it is not the 
haphazard quality of randomness that we use in drawing inferences from random 
samples, but the only thing about it which is not haphazard, namely its property 
of producing definite limits. (I am, of course, spealdng colloquially.) Any 
method of selection would serve equally well if it satisfied this primary requisite. 
The “random” series with which we are familiar in ordinary work are un- 
purposive and chaotic in appearance for two reasons : firstly, because we fondly 
hope to have a series which gives a suite random in regard to all possible 
selectors, so that it can be used to draw random samples from all universes 
whatever the characteristic under consideration ; such a series must be random 
in a very wide selector domain, including all the more obvious selectors which 
would give the series a purposive appearance, and consequently it looks un- 
systematic ; secondly, as an experimental fact we have learnt that when a sample 
is chosen haphazardly it is often random, at least so nearly so that ordinary in- 
spection of a series of results will not reveal the difference; this haphazard 
selection leads us to expect from it an unpurposive-looking result. 

16. But there is no virtue in lack of purpose for its own sake. In fact, 
random sampling has a very definite purpose, and it is the purposive parts of it 
that we have in mind in using the method at all. As the domain of selectors 
becomes larger, the random suite becomes more and more like the completely 
haphazard entity which von Mises would like to make the basis of his theory ; 
but in my view the random suite must always be considered as random in a 
finite domain. I contend that there is no such thing as absolute randomness, 
just as there is no such thing as absolute velocity. The latter has meaning only 
with reference to a co-ordinate framework, the former only with reference to a 
selector framework. I might summarize the attitude of the foregoing paragraphs 
by saying that they are founded on the concept of the relativity of randomness. 
If this be agreed, the difficulties about terming “random” certain series which 
do not conform to the colloquial use of the word at once disappear. 
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A theory of randomness 
Multi-dimensional suites 

17. As a simple extension of the idea of a suite of characteristics we may con- 
sider suites of sets of characteristics. Such an extension oilers no difficulty, and 
is very similar to the transition from describing points on a line in terms of one 
•co-ordinate to points in a multi-dimensional space by several co-ordinates. 

We may amalgamate two or more suites into a suite of more dimensions. 
E.g. with the suites 

we can associate the mth member of one with the nth member of the other to 
obtain (A^B^) {A^B^) .... 

Convolution 

18. We may also construct an m-dimensional suite by dividing a one- 

dimensional suite into blocks of m. This process is worth noticing. Consider the 
suite. AT^A^A^AiA^AiA^A^A^.... 

This is proper and each characteristic is random with respect to the selector 

1, 3, &, 7, 9, .... 

Now- suppose we make a two-dimensional suite by bracketing successive terms, 

{A,A,){A,A,){A^A,).(AA)-- 

This is proper with respect to the two two-dimensional characteristics 
and (Aj Aj) but it is not random with respect to the selector given. 

Definition, I shall refer to the process of deriving a multi-dimensional suite 
from a one-dimensional suite hy grouping sets of successive terms as “convolu- 
tion”, and the derived suitewiU be said to be “ convoluted”. Erom the example 
given it is clear that a convoluted suite is not necessarily random in the domain 
of randomness of the parent suite. 

Independence 

19. Definition. If a two-dimensional suite is derived from two one-dimen- 
sional suites by attaching one member of the first to the member with the same 
ordinal number in the second ; if the original suites and the new suite are proper ; 
and if the probability' in the derived suite of a characteristic [A^Bf,) is the pro- 
duct of the probabilities of Aj in the first suite and of 5* in the second for all j 
and fc; then the two original suites are said to be statistically independent. 

Definition. If from a proper two-dimensional suite there are derived two 
proper one-dimensional suites by ignoring the first and then the second charac- 
teristic of the pairs which constitute the suite; and if these two suites are 
statistically independent, the two sets of characteristics are said to be statistically 
independent in the original suite. 
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Statistical independence as thus defined concerns either suites or charac- 
teristics in suites. Like probability it is a property of aggregates, not of indi- 
viduals. 

20. The generalizations of statistical independence to the case of several 
suites or multi-dimensional suites can be made without difficulty. I shall here 
omit them and the theorems which they obey, since all the results are obvious 
extensions of the theory of class frequencies set out for the case of finite classes 
in the Introduction by Udny Yule and myseK (1939). The following results are, 
however, worth recalling; 

(a) If K, L, M are proper suites, K is statistically independent of L and 
L is statistically independent of if; it does not follow that K is statistically 
independent of M. 

(b) Three suites are statistically independent only if the probabilities 
(AjB^.Gi) are equal to the product of the probabilities Aj in K, in L, and Oi 
in if. It is not sufficient that they should be statistically independent pair and 
pair, as the following example shows: 

K = •* *> 

L = Bj^B^B^B^BiB^B^Bi..., 
M^C^C^C^G^C^C^C^O^.... 

Here, for instance, the probability of (AiB^G^) is zero in the suite obtained by 
associating triads from miembers of the suites which have the same ordinal. 

Local randomness 

21. A suite as defined above is infinite. I now consider a finite series of 
characteristics which I call a sequence. A sequence may be considered as a 
section of a suite. 

It is evident that any sequence, being finite, can form part of a suite in 
which the probabilities have any given value and which is random in any as- 
signed domain. We may, however, imagine the selectors of the domain applied 
to the sequence, that part of the selectors which contains numbers greater than 
the number of members in the sequence being ignored. Similarly, we can 
convolute the sequence in any way consistent with its size and apply selectors 
to the sequences so derived. We can compare the actual proportions of charac- 
teristics in these sequences with those in any given suite. Any such process 
I call a test. 

Definition. If the proportional frequencies in a sequence are approximately 
what they would be in a suite random with respect to the selectors of the test, 
the sequence is said to be locally random with respect to that test ; and so for a 
test domain. 

To make this definition precise it is necessary to consider what is meant by 
‘•approximately”. Suppose the sequence is of size m, and consider the r” possible 
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sequences of this size. To each there will correspond a proportional frequency 
under the tests. Choose a number of these, ar”, which may be regarded as 
“approximately” the same as the proportional frequency in the suite. Then if 
the given sequence is one of these, it is “approximately” the same. Clearly the 
word approximately depends on the choice of the number cc which corresponds 
to what is generally known as a “level of significance”. 

The concept of local randomness, in my view, is important. The series of 
characteristics which we encounter in real life are always sequences, not suites ; 
and we have to estimate probabilities and random properties from finite aggre- 
gates, not from infinite series such as form the basis of the theory. Any finite 
series of characteristics whatever is random in the sense that it might arise, how- 
ever infrequently, in. random sampling. But in order to make any practical use 
of our theory we have to consider certain series as non-random, or in other words 
we have to judge from the local randomness of observed sequences. 

Part 2. Application of the theory to practice 
Events 

22. Events are the primary data of statistical experience. Every event has 
a number of properties, the conceptual abstractions of the Gestalts which it 
provides. These properties may be called characteristics, and it is with aggre- 
gates of characteristics that statistical inference is concerned. The throwing of 
a die and the growing of a crop on a given field are events. Characteristics of the 
former would include the number which came uppermost, the time at which the 
throw was made, the angles which the edges made with a line fixed in space, and 
so on. In general an event has an infinite number of characteristics. When we 
have a complex phenomenon such as a crop of wheat on a field it is a matter of 
choice whether we regard the whole thing as one event, or look on it as a series 
of associated events, e.g. a collection of crops on a number of square yards. But 
the event is to be regarded as including the whole of the happening, and is not 
synonymous with one of its characteristics. A yield of wheat is not an event, 
nor is the number thrown by a die. 

23. Consider then an aggregate of events. Suppose there exists a finite set 
of characteristics such that each event has one and only one characteristic of the 
set; for example, the events consisting of throws of an ordinary die must have 
one of the characteristics 1-6, according to the mimber which falls uppermost, 
and cannot have more than one. We can then say that the aggregate of events 
gives rise to an aggregate of characteristics. 

24. Now the aggregates of events we meet in experience are always finite. 
It is true that we sometimes regard a line as composed of an infinite number of 
points or a solid body as an infinite number of particles; but these are mental 
fictions and it is not possible to observe the characteristics of an infinite number 
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of events. The finite aggregates of our experience can be ordered to produce a 
sequence — in fact they are usually arranged for us by the temporal order in 
which they occur. The fundamental problem of linking theory and practice— 
and an analogous problem arises in all frequency theories of probability — is to 
relate the sequences of observation with the suites of theory. 

25. The sequences of observation may be regarded as generated by a 

physical process. The sequence consisting of throws of a die, for example, may 
be considered as defined by the rules under which the die is cast. A sequence of 
crop yields is determined by the circumstances under which the crop was grown. 
I shall assume that the physical process generating a sequence can be depicted 
by a mathematical law defining a suite, in the same way that the “straight 
lines” we draw on paper can be depicted by the straight lines of Euclidean 
geometry, or a rigid body by the abstractions of mathematical dynamics. 
Members of a sequence are ascertained by experiment; those of a suite by 
calculation. ' 

26. I also take it as empirically established that there are observational 
sequences which can be adequately described as locally random sections of 
suites ; random, that is, in certain domains. And I assume that the processes 
generating these sequences will, if continued, produce further sequences which 
are also locally random. This is essential to all scientific inquiry, that a law 
which is established will continue to operate. If it does not, we must alter the 
law; but before carrying out the extra trials we can only act on the assumption 
that the law will hold. Put in this way, perhaps, the assumption seems un- 
justified, but it is made every moment of our lives. In writing these words I am 
assuming that a past phenomenon will recur, namely that a particular arrange- 
ment of marks on a piece of paper will evoke certain ideas in the reader’s mind. 
This much I am compelled to concede to those writers, like Mr Keynes and Dr 
Jeffreys, who contend that probability cannot be defined in terms of frequency, 
namely that the uncertain attitude of mind which one adopts towards some 
laws cannot be measured by probability as I have defined it. As to the phe- 
nomenon of mental doubt, the scientific procedure by which it is removed or 
strengthened, the desirability of measuring it, I am in agreement with Dr 
Jeffreys; but I do not call the measure probability in a statistical context. This 
is only a matter of words, but unfortunately so is a great deal of statistical 
discussion. 

27. The probability of a characteristic in a physical process is to be estimated 
from the observed sequence to which the process gives rise. I need not dwell 
here on the methods of estimation and the ideas which underlie them. But there 
is one point of some importance to note. It is evident that probability is a 
property of characteristics, not of events, and to be strictly accurate we should 
always speak of it as such. For instance, if I toss a penny on to a chessboard, the 
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limit of the proportion of heads may be but the limit of the proportion, of cases 
in which it falls on a white square may be Neither of these fractions is the 
probability of the event. They are probabilities of characteristics and there is 
nothing inconsistent in the fact that they are different. 

Independence 

28. The statistical independence of two suites, or of characteristics in a 
multi-dimensional suite, was defined in paragraph 19, and the statistical inde- 
pendence of observed sequences follows the same line. In statistics the question 
whether two series of characteristics are independent is to be determined purely 
from the experimental data. There may be very good reasons why one ohafao- 
teristio is “dependent” on another in a causal sense, but if the occurrence of one 
is not accompanied by the occurrence of the other in “unexpected” proportion 
they are statistically independent. Contrariwise, there may be no obvious causal 
nexus and yet the two may be statistically dependent. In fact, I would be in- 
clined to deny any separate meaning to “causal” dependence other than that of 
statistical dependence (with perhaps, allowance for the temporal element) . 

29. This point is important in one respect. I have up to the present spoken 
only of the independence of characteristics, not of events, and even of the former 
only in terms of suites or sequences. But in the theory of probability as ex- 
pounded in textbooks it is quite common to meet with such expressions as “a 
series of independent events”, or “successive events are independent”. The 
word “event” here means what I call a characteristic; but can we speak of “a 
suite of independent characteristics”! I do not think so. In my opinion the 
concept is equivalent to that of the Irregular KoUektiv of von Mises. 

30. For example, one would be inclined to begin an approach to a definition 
of the concept by requiring that each characteristic was followed equally fre- 
quently by all characteristics of the suite, e.g. that in a suite of characteristics 

an A^^ was followed equally frequently by an or an A^. But this is true 
of the suite 

which is clearly not of. the, type desired. One might then require that each 
characteristic should, in addition, be followed next but one by all other charac- 
teristics in equal amount. But this is true of the suite consisting of repetitions of 

AiA^A^A^AiAiAiA^.... 

Baffled by continual examples of this kind, one might then require that the 
occurrence of any characteristic was to be independent of all or any of the 
characteristics which have preceded it. This, on analysis, is found to be equi- 
valent to the requirement that the suite shall be random with respect to all dis- 
joint selectors of the type considered in paragraph 6; and this is precisely the 
difficulty of the Irregular KoUektiv. 
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31. I can see no way round this difficulty; and I therefore reject the suite of 
independent events as I reject the Irregular feollektiv. This has two important 
consequences, the first concerning the ordinary theorems of probability and the 
second concerning Bernoulli’s theorem. 

The first point may best be illustrated by an example : suppose the proba- 
bility of getting a head with a toss of a penny is |. What is the probability of 
getting two heads with consecutive tosses? Anyone grounded in the classical 
theory would answer “ without hesitation. Nevertheless, the result is only 
true under certain conditions. In fact the data of the problem are that there is 
a suite of throws of the penny and that the proportional frequency of heads in 
this suite is Now such a suite might be = heads, = tails) 

and the probability of getting two successive heads is zero. But, it may be ob- 
jected, this is an artificial series which would never occur. To this I should reply, 
Eigreed, but why should there not occur a natural series in which the proportional 
frequency of pairs of heads did not tend to J? It will, I hope, be clear on re- 
flection that there is nothing in the data of the problein to require the answer J 
as' a logical necessity unless we make some additional assumption such as this : 
the occurrence of one characteristic is statistically independent of the occurrence 
of the next. This contains the answer of J implicitly. 

32. In generalization of this problem we might ask: if the probability of the 
characteristic is p, what is the probabihty that in a set of n characteristics we 
shall get r successes and n — r failures? Here, again, the answer of the classical 

theory would be P*" (1 — and here again the result is only true if we 

assume the statistical independence of sets of n. Clearly if the result is to be 
true for all n wp are once more verging on the suite of independent charac- 
teristics referred to in paragraph 30. I conclude that for statistical purposes the 
results of the classical theory of probability are not to be accepted without 
examination. If in any particular case we require one of these results, we must 
be satisfied that the suite we are considering is such as to justify the use of it. 

Bernoulli's theorem 

33. The well-knowil theorem given by James Bernoulli in the Ars Goniectandi 
is subject to similar limitations in regard to its statistical applications. In 
essence the theorem is a proposition in algebra which may be stated thus : in the 
binomial distribution (p + q)'^, if u be the sum of the greatest term and the n 
preceding and n succeeding terms, the ratio of u to the sum of the remaining 
terms may be made as large as we please by increasing n sufficiently. There can 
be no criticism of this result. But, as applied to statistical series, the theorem 
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states that if the probability of a characteristic i« p and we observe m sets of n 
events, the pr( 3 portion of seta in which the i»r(!p(jrtion of smajcsseH differs from 
p by less than e tends to unity with large m and large n. Or put another way, 
the probability that the proportion of sucoeases in a set of m differs from p by 
less than e tends to unity as rn tejids to infinity. Symbolically, 

P{\p-h{A)\< e} > 1 - J/, wi > M, 
where li(A) is the proportion of successes. 

34. This is a statement about the probability of a probability and 1 need not 
emphasize the logical weakness inherent in it. On looking into the pnjposition 
further we find that it is dependent on the type of assumption considered in 
paragraph 32, namely, that if the probability of a ebaracderistic i.s p, the proba- 
bility of r characteristics in a set of n is p' (1 

This will only be true if the observed series is, approximaf (dy at least, a series 
of “independent” characteristics. Consider, for example, the suite 

the characteristics of which are random with respes’t to the stdeefor 

1, 3, 5,7,9, ... 

and have probabilities each equal to J. Consider the convoluted .suite 

{A,A{} {A,A,) {A, A,) (A., A,). 

Bernoulli’s theorem, as usually stated, would lead us to the ('(mclusion that the 
probability of getting in this suite a pair containing one and one is .J. 
Actually it is zero. 

If, once again, it is objected that this is a highly artificial .series, I reply as 
before that series with the same properties might arise naturally. We can only 
make Bernoulli’s theorem legitimate by postulating that the suite to whiidi it 
is applied shall have the property of randomness under convolution for all n. 
I do not think tliis is always a legitimate hypothesis to make, l)ut 1 am anxious 
not to be misunderstood on the point. There undoubtedly exist Moquenoc.s which 
can be regarded as belonging to suites random in a very wide domain- so wide 
that for many practical purposes they can be taken to be seciueiu^es of "inde- 
pendent” events. The point to be stressed is that this assumption nnderlic.s a 
great deal of statistical work but is never brought to light, and ituloed is often 
not realized. The statistician takes it for granted ; but to the philosopher mjthing 
is more surprising than the orderly disorder which is common in Nature. 

Random sampling 

35. A statistical universe is an aggregate of objects, which may he finite or 
infinite. I consider selective processes which consist of abstracting one member 
at a time from this universe, and I suppose that each member is returned to the 
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universe after drawing if the universe is finite. The abstraction of each member 
may then be considered as an event, whose characteristics may be noted. 
I assume that there exists for these events a set of characteristics such that each 
event must bear one characteristic and can bear only one. 

We may then imagine this selective process, which I shall call sampling, as a 
generator of a sequence of any desired extent, and to be capable of continuation 
without limit. The result of unterminating sampling would be to give a suite of 
events, to which there correspond one or more suites of characteristics. In 
practice we shall have a sequence only. 

36. Definition. If a sequence obtained from a universe V by sampling is 
locally random for a characteristic A within a selector domain D, then the 
sampling is said to be random for U with respect to A within the domain Z>; 
and any member of the sequence is called a random sample for the characteristic 
A in the domain D. 

This definition brings out the extremely relative nature of random sampling. 
A method which is random for one universe may not be random for another; a 
method random for one characteristic may not be random for another,, even in 
the same universe; and the randomness is always relative to the selector domain. 
It is also to be noticed that the sampling process, being physical, can only be 
related to sequences, not to suites. 

The assumption we make in using a random sampling method is that if it has 
in the past generated locally random sequences it will continue to do so in 
similar circumstances. The justification for this assumption is empirical. 

37. In practice we sometimes draw samples one at a time and so obtain a 
one-dimensional sequence. We then convolute this sequence into groups of n, 
making an n-dimensional sequence. But we may also draw the samples in a 
block of n (which I shall call a “clutch”). The difference is of some importance. 
If we ignore the order of the individuals in a convoluted sequence we have what 
is virtually a clutch, and it 'is very common in statistical work to ignore the order 
in this way. A series of sampling results, for instance, are frequently given with- 
out any indication of the order in which the individual results appeared. It 
should not be overlooked that certain information relevant to the randomness 
of the sample has disappeared in the process. For example, we may be told that 
in a sample of 1000 births 610 were male. We should probably conclude that there 
was nothing in the sample to show that it wa? not random. But if we know that 
the first 510 wera-male we should certainly conclude that it was not. 

38. What are the grounds on which a selective process of experience is con- 
sidered to give random samples? In the first place, as Ms already been re- 
marked, we can only use a selective process to produce finite sequence. This 
sequence is always locally random in some domain or (fiHier. If we find that, as 
the sequence is increased, local randomness is maintsjfined, we may say that the 
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method is random for the imiverse and characteristic considered. But we require 
more for a method to be used in practice. We require to be able to suppose with 
some confidence a iJfion that it will be random for fresh inquiries, fresh universes, 
fresh characteristics and fresh sequences. And we require the domain of random- 
ness to be as wide as possibie. It was formerly the custom to assume randomness 
to any desired extent if there was no obvious reason to the contrary — a sort of 
Principle of Non-sufficient Reason. This is most unsatisfactory, and could only 
be justified if it was found in practice that haphazard methods of selection give 
locally random sequences. In fact we find that whenever any element of 
personal choice is allowed free play, bias is very liable to appear. 

39, It seems to me that we can never rid ourselves entirely of the possibility 
that a method of selection may lack randomness; but we can safeguard against 
the possibility to a great extent. For instance, the method of Random Sampling 
Numbers appUed to a universe of names in a directory gives us something near 
certainty (if I may be allowed that colloquial expression) that the resulting 
samples will be random. Furthermore, we can experiment with a method to see 
if bias has appeared. If it has not, we are justified in expecting that it is random 
for the class of cases in which it has been tried. Ultimately, however, the 
assumption of randomness is part of the hypothesis which is being tested. 

40. An assumption which is usually made in practice is that the method is 
random within whatever domain happens to suit the investigator at the 
moment. One draws a random sample from the universe of inhabitants of the 
British Isles. One says that the sample is “random” without any qualification. 
Behind this lies the assumption of the Irregular Kollektiv which has been con- 
sidered from a different angle in paragraph 30. A great many statisticians would 
use such a sample to test any hypothesis about the universe which they chanced 
to encounter; they would assume that it was random in regard to height, sex, 
age or any other characteristic ; they would assume that it was random under 
convolution; and they would assume the legitimacy of testing in any sampling 
distribution which happened to be convenient. All of this amounts to an 
assumption of randomness in a very wide domain, depending on a subjective 
judgment which may be quite wrong. The wider the domain, the less likely 
(again speaking colloquially) is the assumption to be justified. In practice this 
assumption frequently has to be made, and can be made without much danger 
with a good sampling method. But the greatest danger lies in the fact that the 
person making the assumption very often does not even realize that he is doing 
so. In any sampling inquiry it is necessary to ask oneself. Is the sampling method 
I am using random fqr the universe I am considering, for the characteristics I am 
discussing, and for the^ sampling distributions or tests of significance I am em- 
ploying! Randomness i|^relative. 
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A GEOMETRICAL ANALYSIS OR THE FREQUENCY 
DISTRIBUTION OF THE RATIO BETWEEN TWO 

VARIABLES 


By C. NICHOLSON, M.G., M.A., M.D. 

This subject has already been tceated by Geary (1930), and by FieUer (1932), the 
approach to the problem in both cases being algebraical; the geometrical approach 
to the same problem suggested itself to me when I was working on a series of 
anatomical measurements (Nicholson, 1938). This paper could hardly have 
reached publication without the generous assistance of Mr N. L. Johnson of 
University College, London. 


(1) Variables indepehdent 
We are to consider the distribution of the ratio 


a: + A’ 


where Y and X are constants, and the joint distribution of x and y is given by the 
normal bivariate surface 

1 r lfa:2 yi]-\ 

* J’ 

Then if we refer our observed values {y+ Y] and (x-{-X) to the axes ?/ = 0 and 
* = Ojjfche co-ordinates of the intersection of the zero values of the variables will 
be —X and - T, since a;-t-A = 0 when x = -~X. The ordinates for a constant 
value of the ratio will lie in a plane surface passing through this point of the 
general form 

y~mx + c, 


where m will have the value of the ratio, and c will be ml - f. The equation to 
the projection of the section of the normal surface by this plane on the (x, z) 

1 in V ' ^ 


plane will be 
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which can be rearranged to 
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The area of this section is the integration of z from — oo to + 00 , the variable 
(because of the change of angle) being x^{l+m^), and this is 




V'(27r)V(mV2+o-p 2W{in^ 


expf-^ 


ri 

M + O"?)/ J 


( 2 ) 


The quantity ■ 


• in (2) is equal to 


mx-r 


7 — aT -- \-i T, -^2-, — > which is the function of 

V(mV| + a-2) u ^(m^crl + al) 

the ratio which Geary treats as t. Now m is the tangent of an angle, say /?, and (2) 

may be put in the form 


1 r 1 I CCOBjS 

^(277-) ^( 0-2 sin® /? + cr* cos® /?) L 2(v'(o"|sin®/?+o-® cos®/?) 

Here c cos P is the perpendicular distance from the origin of the surface to the 
plane y = mx + c, so that we can draw the conclusion that a series of parallel plane 
sections of a normal surface making an angle of p with the x axis are normal 
curves with a standard deviation of 





^(cr| sin^yff + cr® cos^P) ’ 

while their areas form another normal curve -with the variable c cos p and with a 
standard deviation of ^(cr2sin2/?+cr2cosV)- 


(2) Vaeiablbs oorbblatbd 

Clearly, in the case where the primary variables are not independent, it is still 
possible to reduce the distribution to this same geometrical system, so that we 
may discard reference to the primary variables and refer rather to the principal 
axes of the surface generalizing (3) as 

1 

^(27t) sin* (a + 6) + 6* cos* (a + 6)) 

r M n'i 

2\V(a*sin*(a + ^) + 6*cos*(a + 0))j J’ ^ ^ 
where, referring to Fig. 1, a and b are the standard deviations in the direction of 
the major and minor axes respectively of a normal surface; also k is the distance 
of a point K from the origin, K being the focus of a pencil of planes cutting the 
surface, and k being equal to + F*). The angle KOA. is a, which is the absolute 

value of the angle less than between the major axis of the surface and the line 
joining the origin to the intersection of the zero value of the variables; it is 

|tan-i(r/X)-(y|, 

where S is the angle between the x axis and the major axis which is given by the 

6 is the angular deviation of any plane from the angle a, which is taken as the 
origin of the pencil, and the value of any ratio will be given by tan (a + 5+0). 

Biometrika xxxii , 
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Frequency distribution of a ratio 

It is clear that if we were to confine ourselves to the distribution of the ratio 
we must use tan (a + ^ + (9) as the variable, but in this generalized form it is more 
informative to use the angle itself as the variable. The cumulative frequency for a 
deviate 6 is the content of the two plane angles PKT and P'KT' , and where K lies 
without the bulk of the distribution the content of the angle PKT will be negli- 
gible, The content of the angle P'KT' may then be taken as the content between 



Pig. 1. Projection, of normal bivariate surface to illustrate the geometry of 
the ratio between variables. 


the two ])arallel planes, TKT' and SOS', the latter passing through the origin of 
the surface, and this is 

1 

<J{27r) ain* 6® cos® (a + 6 )) 

/•fcslnS r 1 ( u I®”! 

Jo 2|y'{a®8in®{(Z + 0)-|-6®oos®(a-f (9))| _ 

If in (5) we put as t, we get 

^/(ti®sin®((X-l-o)-f<)®cos®(a-{-0)) ® 

feeing 

1 j'V(o'sln*(a+0)+6’oo8Ma+e)) 

where, if the variables are independent 

ccos (a + (9) _ mX~Y 

sin® (a + d) + 6® cos® (x+d)) sin® (a + 0) -t 6® cos® (a -f 0)) ~ ^7(mV® + cr®) ’ 

80 that (5) is identical with Geary’s formula. 
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Continuing to regard d as the variable, the equation to the curve for the 
frequency distribution given by ( 5 ) is the moment of the area ( 4) about if , and this 
can be arrived at either geometrically or by differentiating (5) with regard to d\ 
it is 

_ 1 sing sin (a + ^) + 6^ cos a cos (a: 4- ^))) 

^ ^J{2n){ (a^ sin® (a + ^) + 6® cos* (a + 1 


1 j ksinff 

^ ^ 21 v/la^sin® (a + ^) + cos® (a + 0)) 


( 6 ) 


Here the numerator of the factor within brackets may be put in the form 

A sin® a + 6* cos® a) sin (a + y + 0), (7) 

where y is the absolute value of the angle which the axis conjugate to POP' makes 
with the major axis of the ellipse, i.e. where tany = (b^la'^) cot a. It is thus seen 
that the curve is limited, the range of 6 being from — (a + y ) to 77 — (a + y ) ; at these 
angles the value of y is zero. 

It should be noted that the content between the planes 808' and TKT' is 
equal to the content of the angle P'KT' minus the content of the angle PKT, so 
that in the integration the value of the content of the angle PK T is twice neglected . 
It follows that if we integrate (6) between the limiting values of (? the total amount 
neglected is twice that part of the surface which lies beyond a plane QKO' which 
passes through and makes an angle of y with the major axis, the value is 


2 

y(2;r) J fc/(r„ 

where cr^ is the standard deviation of the normal curve given by a plane section 
of the normal surface which makes an angle of a with the major axis, i.e. where 


_ .ab 

* sin® a + h® cos® a) ' 


(3) Curves given by this equation 

The curves generated by this equation are of very great variety, the majority 
being bell-shaped, and we may now discuss the effects produced by varying the 
constants. It should be noted that, as POP' bisects the normal surface, the origin 
of the curve is neither the mean nor the mode but the median. 

“k" may have any value from or 4a-„ up to infinity; as k tends to infinity 
the distribution of the standardized variable 6 tends to normality with a standard 
deviation of 

^ -^(a® sin® (X -i- 6® cos® a). 

As k decreases in value the departure from the form of the normal curve becomes 
more marked. 
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Frequency distribution of a ratio 

“alb” may have any value from unity to infinity. When the value is unity the 
curve is very similar in form to the normal curve; as the value of ajb increases 
departure from the form of the normal curve increases. 

On the value of a the symmetry of the curve depends; a may have any value 
between 0 and ^tt. At zero the curve is symmetrical but steeper than the normal 
curve. As a increases asymmetry develops (asymmetry being taken to mean the 
excentrioity of the median), the maximum excentricity being reached at a value 
of tan-^ (6/a), thereafter the curve returns gradually to symmetry at a value of 



. Fig, 2. Cum from equation (0). ConstantB: o=4, 6 = 1, A=;3of,„ a=80°. 

where it is flatter than the normal curve. Skewness develops with asymmetry 
but more slowly, and the maximum skewness is not reached until a has a value of 
irr. , 

If k is relatively small, i.e. is near 3o-„, and a/6 is large, and if a is near ^rr, we 
get a curve with a maximum value on each side of the median, symmetrical when 
a is \7r. This distribution does occasionally arise in practice; an example is given 
by Udny Yule (1932). Mg. 2 is an example of a slightly asymmetrical curve of 
this type. 

(4) A GEN-EKAL SOLUTIOK 

K may very well occupy a position within the bulk of the surface; this must 
happen when both of the variables have negative values If we can obtain an 
equation of the curve in this case, it should have a general application to all values 
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of h. Geary stated this problem but did not proceed to its solution, Fieller gave a 
general algebraical solution. 

We may consider K as lying on the circumference of the ellipse 


where h==kjcr^ and the ordinate on the circumference of the ellipse is 


1 

27ra6 




In terms of the primary variables, 

- 1 ^rXY F2\ 

~ cr^(Ty a%)' 

As before, the frequency curve of the angle 6 is given by the positive moment of the 
normal curve (4) about the ordinate at K. This moment may be considered in two 
parts, the moment arising from the portion of the curve without the ellipse (A), 
and the moment arising from the portion of the curve within the ellipse on which 
K hes (B). 

(A) For any normal curve the moment about the origin for that part of the 
curve beyond the ordinate at a given deviation her is 


which is equal to 




h<r 

yoCr2e-i^‘. 


That is to say the moment is a function of the ordinate at the given deviation and 
of the standard deviation. Moreover the moment about this ordinate of the two 
equal tails of the curve is equal to the moment of these tails about the origin of the 
curve, so that in our case the required moment is 


e-Vt' ab 

n a* sin® (a + 0) + 6® cos® (a + d) ' 


( 8 ) 


(B) The length of a chord of the ellipse which passes through K and makes an 
angle of a + 0 with the major axis is 


2hab 


i/(a® sin® (a + d) + 6® cos® (a + d) ) 




ksmd 


sin® (a + 6) + b^ cos* {oc + 6)) 
ksiad 


( 9 ) 


( 10 ) 


and if we make sinci = ^ ^ 

^ A-y(a®sin®{a + d) + 6®cos®(a+(9)) 

we may put (9) as 2h ooa<f>a'^^+gy The total area of the normal curve in this plane 
using (4) is j 

^( 2 tt) sin® (a + 0) + 6® cos® (a + d)') 

so that the ordinate at its origin is 

g-it(A Bln 


e-KAslu!/l)‘ 


2nab 
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and. the area of the portion within the ellipse is 


mb 


-KftslnjS)’ 


1 009 ,. 8 ) 
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exp 


~ ' 1 / V 
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This is multiplied by h oos^o'j^.^a), 

1 1 ‘h cos ii 

sin ^)*crfa4.8)fe COB^J e~i“’d! 4 , 

which may be simplified uito 


adding (8) and (12a), we have 




y 


ab 


•n a®sin®(a+0) + 6*cos^(a+d) 


( 11 ) 


( 12 ) 


+ry-g(fecos^)« + r3--5-;7(^‘!os^)»+... ; (12a) 


1 + (fe cos?!)® + — {k cos ^)* 
1 » 0 




as the equation of the curve. This expression converges quite rapidly for values of 
h up to 3; thereafter convergence becomes very slow indeed, 

Reverting to (10), the value of sin^ may be put in this form 

at sin ^ 

^(a® sin® a + 6® cos® a)f{a? sin® (a + ^) + 6® cos®" (a + 0 ) ) ‘ 

From this the following identities may be established: 

a®sinasin(«+0) + 6®cosacos(a+0) 

^ f(a^ sin® a + 6® cos® a)f(a^ sin® (a + 0) + 1® cos® (a + S)) ’ 

_ cd) 

dd a®sin®(a+0)+t®cos®(a+0)’ 

^ = tan~® {(a/6) tan (a + 6)} - tan~^ {(a/6) tan a}. 

It should be noted that the .limit of the integral in (6 a) is ft sin ?! and that the value 
of ^ in (16) makes the practical work of calculating a series of these limits very 
simple. If the distribution of 6 is still to be regarded as round the median at a, 
9 will have the limits as before, - (a 4- y) and tt - (a +■ y ) , and at these limits ?! will 
he generally, however, 9 may be regarded as having a range of n beginning 
at any value, in which case ?! will have the same range but with a different 
distribution. 

A geometrical construction to show the relationship between 6 and (p is given 
in Fig. 3, where OA and OB are equal to a and 6 respectively, and O'AB is a 
right-angled isosceles triangle on AB as hypotenuse. 


( 10 a) 

(14) 

(16) 

(16) 
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Here 


AP = 


aBuia 
sin OP jI 


and 
so that 

by the same reasoning 

similarly 

therefore 


6ain(^7r-a) 
sinOPP ’ 

APjPP =? (a/6) tana; 

APjPB = tan AO' P, 

AO'Q = tan-J-{(a/6)tan(a + (9)}; 
PO'Q = ^. 


This relationship between 6 and ^ shows that the solution consists essentially 
in referring an asymmetrical system to an equivalent system where the standard 
deviations are equal. 



If we now turn back to (6), this can be put in the form 


,, = 1 Aa6 sin a sin (a + (9 >+ W cos a cos (a + g) ) ,,, 

^ Ay(27T) (a^sin®a + 62cos®a)i(a^sin2(a + <9) + 6^cos^(a + ^))* ’ ^ ' 

so that we have the approximation of (6) to (13) 


caooh^ \ 
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xpressi 
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It will be seen that the last expression may be written 
d6 e-W 


• h cos^ °°® ' 


*CC 

')■ 

Jh 


ooa iji 


eri^'du). (19) 


The difference between the values of y given by the two equations (if we do not take 
into account the value of d^/dd) is given in the table below for different values 
of h and for <j> == \n and 0, showing that (6) must be a poor approximation to (13) 
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when h is leas than 3, but that beyond that figure the dififerenee rapidly becomes 
negligible. ' 


h 

Difference between (6) and (13) | 

d>"i7r 

((S=0 

1 

0-193666 

0-066476 

2 

0-043084 

0-006776 

3 

0-003536 

0-000306 

4 

• 0-000107 

0-000006 


Fieller’s general solution of the problem is given in his formula (24) 
1 or^<Ty,J{l~r^) r I I X y , 

~7rtr§-2rw^(r„+i;2cr2®’‘^L. 21-r«\o-| ^ (xy\ 


+ exp .--5 


{y~vxf 


2 <r* — irw^cr^ + t^crl_ 


|] 


gyCtT/ft— £g’»)+w«(>^g'>— ypit) . 

<ry{ry(T^~X(Ty) + vcr^{rx<Ty-yaf^ f <^»o-y((i~r*) j„, , 

n((rl-2rvo■^cr^ + ^^(rl)i jo 

V is the value of the ratio under consideration which may be put as tan (a + ^ + ^) , 
and if we reduce TieUer’s formula with its symbols based on the co-ordinates of 
the primary variables to the system with its symbols based on the principal axes, 
- we get the following identities: 

aji-^^rvar^a^ + v^al = (1 -f«*)(£i2sin*(a + ^) + 6®co8®(«-t-^)), 


l_r2= (o'*6“)/(tr|cr5), 

2 ^ y I sin^g + cos^a) 

(r| cr® (Tlal ' 


{y - vx)^ = (1 -f «*) fc® sin® 9, 


~ + Wa,(ra:<ry - y<T^) == k f (I + i^) 

X (a® sin a sin (a + ^) -i- 6® cosa cos (a ■+• i9)), 
so that Keller’s formula as a whole reduces to 


^ nl + v^dT d 9 n{l + v^)jo 

Here d^jdd = 1 -ft)®, so that the distribution ^{v] of v given by Keller’s equation 
is equivalent to the distribution of (9 in the sum of (8) and (12), . 

The curves arising from (13) are of very great variety of form. They are all 
limited, indeed they are more properly described as cyclical, the ordinates at the 
limits for dj since <f) then equals T ^n, being both equal to 


e a*sin®a + b*coa®a 

TT a6 (a® sin® a-H6®cos®a)‘ 


( 20 ) 
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That being so there is no necessity to regard the curve as beginning and ending at 
the limiting values for d\ in practice it would probably be taken to begin either 
where a + (9 is zero or where the ratio under consideration has a value of - oo, i.e. 
where a + 5 + 0 is The tmit deviation in both equations (6) and (13) is the 
radian; for practical work Ttjn would probably be used which would require a 
. corresponding alteration in the equations. If we continue to regard the curve as 




Kg. 4. your curves from equations (13) to illustrate the change in form as h inoroasca 
fromOto 3. The curves oommonoo where ot4-0=O. Constants: a = 2, h~ 1, as=46“. 


drawn with the median at a, we find that when li is zero it is U-shaped except when 
a is small, and is more or less asymmetrical depending on tlie value of a. Witli 
values of Ti about 3 it is possible to produce curves of a highly asymmetrical type ; 
as h increases beyond 3 the curve reverts to the hell-shape and the limits recede 
from the bulk of the curve until, as we saw, when Ti is infinite the curve becomes 
the normal curve and is unlimited. Fig. 4 is an example of the development of the 
curve as h increases from zero to 3. 
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(6) iHTBQRATIOir 

The value of a frequerujy is the integration of y in (13) with respect to the 
variable d and, making use of (16), this is 

p-ih‘ ri 1 

== _^J h + 008^4)2+ j-g {hcoB^Y 

+y|^(/icob^) 8+ j-g^(/icoa9i)8+...|#. (21) 

An expression for this integral may be obtained by converting the series in 
powers of cos 6 into a series of cosines of multiples of ({>, integration then gives a 
series in sines of multiples of <j)‘, but the functions of h which are the factors in the 
series are very complicated and do not lend themselves to easy computation so 
that it is better to reconvert into a series in powers of cos and (putting m for 
P®) this is 

Q(^) == ^ - e-"!) + 1(1 - ~ e-«TO) cos® «;4 

+ |l - er”^ - e~”*m — j eos!^<j> 

+ g g - J 1 - e-'^ -e-«m - e-«* gj - e"”* I cos® ^ (22) 
where the occurrence of the terms of Poisson’s series is very interesting. 

(6) COKCLUSION 

The frequency distribution of the quotient of two normal variables may, then, 
give rise to most of the forms which are met with in statistical work. It is not, 
however, suggested that such statistical distributions always arise in this precise 
fashion; at the same time, from geometrical considerations, it seems likely that 
the product of two variables would produce a similar set of curves. It is not 
impossible that a large number of primary variables might group themselves into 
two secondary variables of approximately normal distribution and that the final 
distribution is some function involving either the quotient or the product of these 
variables. However that may be, the fitting of this curve to any given distribution 
appears to present many difaculties and is quite beyond the scope of this paper. 

(7) Example 

The following example illustrates the practical use of ^ in the application of 
Geary’s approximation. Some of the difficulties of childbirth are undoubtedly due 
to a disproportion between the size of the foetal head and the size of the bony 
opening through which it has to pass, the brim of the pelvis. This difficulty 
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becomes absolute, i.e. demands caesarian section, in about 1 % of all cases (in 
Guy’s Hospital (1937) ten cases were dealt with by caesarian section on account 
of disproportion out of 990 pregnancies in a fair sample of the population); it 
would be well to know for the purjroses of prognosis the percentage ratio between 
the size of the passenger and the size of the passage beyond which spontaneous 
delivery becomes impossible, The foetal head, as it passes, is roughly circular in 
section and the area of the maximum section may be calculated from the bijiarietal 
diameter; the figures for this diameter are taken from a series of 1010 measure- 
ments by Ince ( 1 939) . It has been shown that the area of the pelvic opening is given 
to a close enough approximation by the area of the ellipse on its antero-posterior 
(conjugate) and transverse diameters; the figures for these are taken from a series 
of 360 measurements made by radiology (Nicholson, 1938). It might be well to 
add that the radiological method used (Nicholson, 1936) has a probable error of 
accuracy as low as a millimetre. 

These figures are 



Bipariotal 

Conjugate 

Transverse 

Mean, (mm.) 

91-6 

116-4 

132-3 

Standard deviation (mm.) 

4-0 

10-6 

7-0 

Coeflaoient of variation 

4-4 

9-0 

L- 

5-8 


The distribution of these variables is normal, and the two latter aro independent, 
so that we can estimate the following figures for the two areas; 



Soetal head 

(y) 

Maternal pelvi.s 

{X) 

Mean (sq. ora.) 

65-8 

121-0 

Standard deviation (sq. cm.) 

5-S 

12-() 

Coefficient of variation 

8-8 

. 

10-7 


The distribution of these variables is not, theoretically, normal but the error 
from assuming normality will be negligible; we shall also assume that they are 
independent, an assumption which is apparently not unreasonable. We may now 
calculate the following constants for the frequency curve for the ratio : 

a = 12-9; 6 == 5-8; a = tan'i (T/Z) = tan-^ (65-8/121-0) = 28® 32'-25; 
k = 137-703; cr^ = 9-357; h = 14-720. 
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Prom the tables of the normal curve we get the deviate for a frequency of 
1 % as 2-3263, and applying (5«), (10), and (16), we have 

hm^ = 2-3263, 
sin^ = 0-15804, 

^ = 9 " 6 ', 

tan“^{^)tana} == 50“ 26', 
tan^^{(a/6)tan(a+^)} = 69“ 31', 

(a/6)tan(a+^) = 1-69879, 
tan(a+d) = 0-764. 

The required percentage ratio is then 76-4 %. Using the usual approximation 
(T/l){l+(crJI)^}, the mean of the ratio would be 65-5%, and its standard 
deviation ( Y jl) 'I'S %; if wehadassumedthatthedistribution 

of the ratio was normal, we should have got a result of 72-9 % ; so that, even when h 
is quite high, the distribution of the tails of the curve may bo far from normal. 

The value of the figure 76-4 from the point of view of prognosis is tliat we can 
now predict that unless an event has occurred, the chances against which are 
99 to 1, a pelvis with an area of llOsq. cm. can pass 99-9 % of foetal heads, that 
a pelvis of 100 sq. cm, can pass 97 %, that a pelvis of 90 sq. cm. can pass 70 %, but 
that a pelvis of 80 sq. cm, can pass no more than 21 % of foetal heads. 
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THE STATISTICAL SIGNIFICANCE OF 
CANONICAL CORRELATIONS 


By M. S. BARTLETT 


1. In an important paper published in this Jmr-ml, Hotelling (1936) has 
shown that the generalized variance matrix* ^ 


of a vector variate x which has been partitioned into two parts Xj and with, 
say, q and p components, can, by appropriate linear transformations and 
LjXji of Xi and x^, be thrown into the canonical form 




R is a rectangular matrix which is zero except for a leading diagonal of squares 
X\ of canonical correlations, 

Similar operations on the estimated matrix variance V give rise to estimated 
canonical correlations l^, which measure the correlations between estimates of the 
linear functions LjXi and LaXg. While Hotelling has given asymptotic standard 
errors for the coefficients 1^, it is known that the significance of these correlations, 
as in the simple case p = 1, is more generally to be interpreted as the significance 
of the regression relations of Xj with x^', the validity of any exact tests of signi- 
ficance depending on the supposition that the dependent variate Xj, apart from 
its linear dependence on x^, is normal. 

Special oases of the simultaneous distribution of the correlations 1^, when Xj 
and Xi are unrelated, have been considered by Hotelling (1936) and Girsohiok 
(1939), but an important theoretical advance is represented by the derivation of 
the distribution (under the same conditions) for any values of p and q (Eishor, 
1939; Hsu, 1939), It will be shown that this distribution makes available further 
possible tests; and since the problem of the most appropriate teats of significanoe 


* A matrix is usually denoted by a capital tetter, and if it baa both a population and aamplo 
value, the population value is given in heavier type (of. Bartlett, 1939). The transpose of any 
matrix A is denoted by A‘. A matrix with only one column is a vector, and is often denoted by a 
small letter. To avoid confusion, a' vector variate X is written in heavier type throughout, to distin- 
guish it from a single variate x. Ifx is measured from its population mean, the variance matrix V 
is the average value ofxx'. In practice we lose one or more degrees of freedom by measuring x 
from sample or regression means, but without loss of generality we shall suppose that our sample 
consists of measurements ofx with v degrees of freedom. 
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has not always been considered very adequately by other writers, it is also tlie 
purpose of this paper to explain the logical relation of these further teats to tests 
of significance previously available.* 


2. For the case p < the distribution of 1^, when these roots are arranged in 
order of magnitude, is given by 

ll)dlldll...dll, 

where F = 0 fl 

i«l,V I 


and 


G 




rup-i+i) 


( 1 ) 


-i +i)ruv-q-i+i}ri(p-i+i )• 

For p > g, we need only reverse the roles of and Xj. 

A criterion which is useful in detecting the simultaneous departure of several 
roots from zero is the product 


n(i' 

i"i 


■ Zf) = A, say.t 


Whenp = 1, the distribution of A is equivalent to that of If, and the distribution 
in (1) can be transformed if required into Fisher’s z-distribution. When — 2, it 
was found by Wilks that a similar distribution exists for a/A. For p > 2, no exact 
test is at present available, but the formula 

= -{‘'-i{ 3 >+g+l)}log/l, 

with pq degrees of freedom, gives a good approximate tost (Bartlett, 1938). 

If the roots are zero, we are, however, including in A irrelevant 

degrees of freedom which might possibly obscure the significance of A|. For any 
test on A| by itself, we have little choice but to consider If, though we do not really 
know whether if is the root corresponding to A| or not. The probability distribu- 
tiont p(Z§) is theoretically obtainable from (1), and hence also 0-06 or 0-01 levels 


* The distribution of 1,^ obtained by Fisher and Hsu has also been obtained by Roy (1030), 
though this writer was oonoemed with the different problem of comparing the dispersion in two 
multivariate normal samples. For a single variate, testing the signiiioanoe of a sum of squares 
separated off from the total sum of squares by a multiple correlation or regression formula is 
equivalent to testing the ratio of two variances, a criterion also employed to test the equality of 
two population variances, Roy has pxopbaed generalizing the latter problem along lines which give 
rise to the same distribution problem solved by Fisher and Hsu, but while ho has independently 
obtained the same general distribution, the need for some care in the ohoioo of tests in multivariate 
analysis is even more evident in the problem with which Roy was oonoemed. It is obvious, for 
example, that the p roots which Roy considered cannot represent all the possible differences among 
the 4 jP(J)+ 1) variance and covariance parameters between two p-variato normal samples, and some 
explanation of their interpretation seems required. 

t This criterion has been proposed by Wilks (1032), Bartlett (1934), and Hotelling (1936), the 
last-named denoting it by a. 

f The probability of a random variable having a particular value m is denoted by p(*). If the 
•variable has a oontimious range of values, pix) denotes the probability of the variable foiling in the 
interval », *-(- da;. The corresponding notations x ) y andp(a; | y) are used 'when the variable is only 
being considered for a fixed value y of another variable. The probability symbol p is not of Course to 
be confused "with the number p of components in the vector variate Xj. 
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of significance of If for specified values of v, p and q. The tabulation of these levels 
would be useful, but would also be a task of some magnitude, and it is therefore 
worth noting that owing to the problem of identification, the largest root is not 
a sufficient statistic for Af, and p(Z|) has no unique relevance. If we consider, 
instead, the distribution of l\ for given values of . . . Z® , we have .corresponding 
to the probability relation 

the probability density relation 

where the function/^, apart from the constant term/g, is determined at once from 
the function F. 

In the logical situation we are postulating where A|, but not the other roots, is 
different from zero, it is not evident which distribution, p{l\) or j Z|, l\), 
provides the more powerful test, owing to the absence of sufficiency properties, 
and it is of some interest to consider in detail another problem which is trivial in 
itself, but serves to illustrate the principles involved. 


3. Suppose we have a pair of variates and iCj both independently following 
a rectangular distribution 


One variate (unspecified) is then shifted a distance a, so that it follows the 

distribution , , j ^ , 

p{x) — dx, («<*<! + a). 


If Xi and *2 denote the variates in order of magnitude, we shall detect the shift 
a from the larger value, x^, if a is large enough. To compare the value of p(xi} and 
p(Xi I x^}, we note first of all that when a = 0, 

p(xi) = 2xj dxi, {0 4Xi^l) 

/ I . dxi 

p(x^ I < ail < 1 ) 

For the significance level e, p(xi) gives a critical value = .^/(l - e), while p(Xi | Xg) 
gives Xi = 1 - e(l - Xg). If a is different from zero, a peculiar feature (analogous to 
the canonical correlation problem) is that the larger observation x^ may or may 
not be associated with a. For p(Xj) we find 

(2x^--a)dXi, (a<Xi<l) 

dx-j^, (1 Xj^ ^ 1 "4* ct) 

For p{Xi I Xa), we have 

2d!xi 

a + 2(l-X2)’ 
dx^ 

a + 2(l-X2)‘ 


(l<Xi^l + (X) 
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This is provided x^^a; for < a, we have 

p{xx j %) = daii. (a 4 Xi ^ 1 + a) 

Using the terminology of Neyraan and Pearson, we shall denote the power of the 
test derived from x^ hy P; and for that from a^il by P'. Then 

rva-o 

1-P=J {2x^~-a)dx^ 

= l-e-a^(l~e). 

For 1 -P', we have first ofall, for given a:*, an integral 

Cl-s(l-x.) 


•1- 
J a-j 


which gives 


2(1 -e) (l-X;) 


{Xi«X) 

(aJa^a) 


a+2(l-a:j) 

Since | a) is given hy 

^ot-{-2(l — (^^aJ^^l) 

dx 2 , (0 < *2 < a) 

we finally obtain, after averaging over x^, 

1-P' = (l-e)(l-a)- Ja^e. 

Before comparing P with P', we may remember that we do not expect either Xi 
or x-i I *2 to provide the most powerful teat obtainable. Theoretically we can see 
what this test would be by considering the ratio 

p{xi, x^ 1 a)lp{x^, *210) = Xg, 

say, though since the value of is indeterminate unless the true value of a is 
specified, it should be realized that X„ does not provide us with any actual test, 
only with a theoretical upper limit for P or P'. 

The criterion has the distribution 

= 00 1 0 
P(-X«|a) = a a(l-a) 

p{X„\Q) = Q [l-af 2a-a2 

For{l-a)®>e,we shall allow the value = 1 to be significant in e/( 1 - a)** of the 
times that the value 1 occurs ; if ( 1 — a)* < e, we allow = 0 to be significant in the 
fraction e-{l-af 

2a -a® 

of the times that X„ = 0 occurs. The power P" of a test that could be based on X„ 

a+e, [(l-a)*>e]. 
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Comparative values of P, P' and P" are given in Table 1 for e = 0-05 and OTO. 

Table 1 


e a _ 

0 

0-1 

0-2 

0-4 

0-6 

0-8 

0-9 

fp 


0-1476 

0-2449 


0-6348 

0-8298 




0-1453 

0-2410 


0-6200 

0-8260 

0-9263 

IP " 



0-2600 


0-6600 

0-8464 

0-9373 

CP 

BiatixW 

0-1949 

0-2897 

0-4796 

0-6692 

0-8690 



■tBOailM 


0-2820 

0-4680 

0-6680 

0-8620 


IP " 

0-1000 

IHil 

0-3000 

0-6000 

0'7000 

0-8785 

0-9603 


It wUl be seen that p(*i) provides a test in this problem rather more powerful 
than p{x-^ I ajg), but that the latter is quite effective. We cannot of course transfer 
this result to our main problem, but it is clear that | If, • • may justifiably 
be considered, at least until the distribution has been tabulated. 

4. Returning then to this distribution, we may examine one or two special 
cases before formally noting the significance level of in general. It has been 
shown by Fisher and Hsu that for v large, the distribution of Zf, Z|, ..., Z|, tends to 

0{rnl, m|, . . ., w* ) dmf dm| . . . dm^, 

where = ^i^Zf, (? = G' fl Jl ("if — wf)l> 

<=i I i={+i ) 

and 1/G' = n{rK?-i + l)ri(p-i + l)}. (2) 

For the particular case p = 2, q ~ 3, the distribution of ml j ot| is 

(mf — m|) dml, 

which is a function simply of - m|. If alternatively we consider the distribution 
pK), we obtain 

the 0"05 significance level for which is 6-37. From p{ml | m|) this value of 5*37 
corresponds to a level 0-030 if m| = 0, to 0-046 if to| is equal to its expected value 
0-50, and to 0-05 when m| reaches the value 0-63. These results merely illustrate 
how the significance level of mf depends on which distribution is being used. 
For the case p = 3, gf = 4, the significance level for m| can be written 

e-{(.+ l) + ^j, 

where u = ml-ml,v = The level of significance thus depends mainly on 

u, as we should expect, but the effect of v is not negligible. The factor multiplying 
the exponential varies, for example, when « = 4, from 10^ for w = 1 to 13 for 
= 0 . 
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The general expression for the significance level for is 

JW 

f ■” e-V n (wf - m|) dml 

J m,’ ^“8 

or, if we write = | 

J<S 

by 

r,„>(i[p+9+i])-{smi|r,,,^4[p+s-i3)+(s . ? 

/ Vi*** 

/t% 


For the more general case of finite v, we haVe similarly for If, 

(I4)«a-r-i) (1 _ ii)Uv-^-p-i) fi [II _ ID dll 
J U' 0^2 ^ 

f ^ (J2)lto-1>-1) (1 _ ll)W-^~p~i) n [Il-ID dll 

J V ^“2 

or, if JEij,(a, ;3) = (* (1 - dx, 

by 

+ 5 + 1], i[v - - g + 1]) - 1 {^\j[> + q~l],iiv-P'-g + ^) + - 

+ 3 + 1]- K»' -2) - ? + 1]) - ! i (ii> + 2 - 1]. K>^ - - 2 + 1]) + ' • • 

W»3 


The dependence of results (3) and, (4:) not only on v, p and q, but also on the 
particular values of . . . , makes it impracticable to tabulate the 0- 06 or other 
levels of significance; but it is not difficult in any instance to find the exact level 
from (3) or (4), using the published tables of r^{a.) or B^((x., /?}.* 

It must be recognized that if the second root A| is also' different from zero, the 
distribution of II for given l| is quite irrelevant, but except possibly when p is 
rather large, it is probable that two or more non-zero roots would be detected by 
the A criterion, and the testing of AJ alone by means of if (a test which is still not 
completely efficient) would not arise. 


5. Directly we have established the existence of at least one root Af, we may 
always proceed to eliminate this correlation A^ and the corresponding pair of 
canonical variates; and analyse the remainder. The theory of eliminating from 
a set of specffied variates represented, say, by the veOtor variate Xo has been 

* Tahles' of the Incomplete F-fundion, ed. K. Pearson (1922, His Majesty's Stationery Office, 
London); Tables of (he Incamplett Bela-funetion, ed, K. Pearson (1934, Biometrika Office, Univer- 
sity College, London). 
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iudioated by Bartlett (1939).* As a particular case, may be a hypothetical set 
of r canonical variates of x^, and the criterion A{v — r,p~r,q) for the remaining 
variate Xj,.o, in place of the original criterion A{v, p, q) for x^, would test the good- 
ness of fit of the hypothetical vector canonical variate Xq. In the case 9 = 1, we 
have the goodness of fit of a hypothetical discriminant function, the problem of 
which was first raised by Fisher (1938). 

It has, however, also been pointed out (Bartlett, 1938, p. 39) that if the canoni- 
cal vector variate Xq has been estimated from the data, the symmetrical relation 
between x^ and x^ will imply that each has only p — r and q — r independent 
components remaining, the approximation for the criterion 

n {i-iD 

i»r+l 

being - {(v - r) - \[(p -r) + (q-r) + 1]} log A' = - (v - ^(p ■+• gf ■+• 1)} log A ' , 

with [p—f) (q—r) degrees of freedom. It was stressed that this reduction of the 
degrees of freedom essentially depends on the existence of non-zero roots 
Af, ..., A^, so that the vector variate Xo is well-determined, and any effect of 
selection of If, from can be neglected. Under the same conditions, 

we may approximately use the tests known for p = 1 or 2, for the criterion 
A'{v-r,p — f,q~r), whenp-r = 1 or 2. 

6. To demonstrate the reduction in degrees of freedom in the case r = 1, 
consider the case when u is large, and 

- vlog A~>- S X*- 

i«l 

If the determinantal equation for 0^ is of the form 

|A-dV| = 0, 

where V denotes the variance matrix of x^, and A is a matrix of the sums of squares 
and products among the p variates of x^ for that portion of the sample separated 
off in terms of the independent vector variate x^. Without loss of generality, we 
shall suppose that V = 1. 

Regarding the v observations for any variate as a vector with v orthogonal 
components, let us now add to the chance variation of the first variate of X 2 a part 
dependent on each of the q (orthogonal) variates of x^. For each variate of x^, 
the length of the vector representing the first variate of x^ wifi, then receive an 
addition A*, say, (fc = 1 . . . i), which will be of order Partitioning off the first 
variate of Xj, we obtain, as our new equation for 0, 



■f' — 0 

dij 


tty-e 


* See equation (2.8) of the paper cited, and the immediately preceding equation. 
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The summation sign is for the q degrees of freedom of Xj, and ~ where 
are the j) variates of Xj. Solving the equation for the largest root, we have 

d, = i:P+ 22'a:iX +«!! + * 


If we neglect the last term, the sum of the remaining roots becomes 


V P 

i*»X 2 




{Exiin 




which with (2> - 1 ) (? ~ i ) degrees of freedom. 


7. To illustrate the use of this test we may comider the data from Kelley 
quoted by Hotelling (1936), these consisting of correlations among tests in ( I) read- 
ing speed, (2) reading power, (3) arithmetic speed and (4) arithmetic in.)wer, the 
sample being one of 140 seventh-grade school children. Hotelling, investigating 
the relation of arithmetical with reading abilities, found canonical correlations 

h = 0'3945, Zs = 0'0688. 

Since v = 139, p = 2, g = 2, the first correlation gives a contribution to of 

-{l39-K2 + 2+l)}log(l-0-394r)») =» 23-09. 

Similarly the contribution from is 0'64. The analysis is consequently sum- 
maiized as in Table 2. 

Table 2 



n.r. 


n 

3 


■■ 

1 

0-64 

Total 

4 

23-73 


It is evident at once, as Hotelling concluded from other tests, that there ia a 
significant relation between arithmetical and reading abilities, which arises 
entirely from the first canonical correlation. 
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ON THE LIMITING DISTRIBUTION OF THE CANONICAL 

CORRELATIONS 

By P. L. HSU 

1 . The purpose of this paper is to deduce the limiting distribution of Hotel- 
ling’s canonical correlations’" under the most general assumption on the popula- 
tion canonical correlations. The result is stated in the theorem &t the end of this 
paper. 

The method employed here is essentially the same as that by which we derived 
(Hsu, 1940) the limiting distribution of Fisher’s discriminating components. 
In what follows, steps in the derivation are given while strictly rigorous reasoning 
is left out. The latter may be found in the author’s 1940 paper. 

The parent distribution is represented by the density 

const.exp|-i( E S rjhCjjj. (1) 


where 


n n n 

— S ~ S “ L yglUhi’ 

1=,1 tml tail 


.( 2 ) 


By virtue of Hotelling’s reduction, the matrix of variances and covariances is 
taken to be 


*11 




a 


■ip ••• Piq 


*00 ^pi 

7ip 


All ••• Api Til 

Ala ■" Ap<t ^4i 


T<ia 


1 .. 

. 0 p[ ... 

0 , 

• 9 


0 .. 

.10... 

Pp • 

.. 0 


Pi • 

.01... 

0 , 

,. 0 

, -.(3) 

0 . 

.. p; 0 .. 

1 . 

.. 0 


0 . 

..0 0.. 

0 

1 


ical correlations. The sample canonical 


• A' ' I i/ X A. 

correlations, r^, . . r^, are the positive roots of the equationf 


ran .. 

. rajp 

bn 

big 

»’«pi ■ 

.. rapj, 

bpi 

••• ^p« 

bn ■ 

bpi 

rcn 

roig 

big . 

•• bpg 

rcgi 

- rc^g 


= 0 . 


.(4) 


* Hotelling (1036). For further work on the distribution of the canonical correlations, 
see Madow (193&), Girschick (1939) and Hsu (1939). 

t We use r in (4) instead of -r as in Hotelling’s original delinition because it is 
know that the non-vanishing roots of (4) form pairs each of which have the same 
absolute value but opposite signs. 
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We set Px — ... = /O^j = pi, 

Pfti+x — . . . = P/ii+fi, ~ Pit 

P/ii+...+/iv-i+X ~ ~ Pa ~ Pv> 

Pa+X — ••• — Pp ~ 

p^>p^> ...>p^>Q, 

and proceed to find the limiting distribution of as •a-^co. 

2. Lemma 1. We-Tiave the identity 

R S =lS|.|P-OS-iR|. (9) 

This results from! the identity 


p 0 


I 0 


P-OS-iR 0 

R S 


-S-iR I 


O S 


on taking determinants on both sides. 


(5) 

( 6 ) 

(7) 

( 8 ) 


Lemma 2. Let 


% = n+^nu.n, 

II 



= npi+^nv„, 



(10) 

<^ga = n + ^Jnw„^, 


{g =t A), j 


{i,j = 1,... 

,p; g,h- 1, ... 

.S'). 



The distribution of the u‘s, v’s and w’s approach that of ^{p + O') (p + g + 1) normal 
variates whose means are zero and whose second moments are specified in the following 
statements’. ' 

(i) any v or w which has at least one suffix number > p is uncorrelated with all 
the others’, 

(ii) any member of one of the sets (i == 1, 

{i,j = l,...,p’,iSpj)is uncorrelated with all the members of all the other sets ; 

(iii) for i,j = 1, ...,p we have 


= ^{wl) = 2, 

«)=1+P?, 

= 2p?, 

«^(%%) = = 2p'i, 

^ul) = e{v%) = <f{wlf) = 1 (i + j ), 
= p'iPj (i 4= j ), 

= p] (i ). 


( 11 ) 
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This follows immediately from the well-known central limit theorem ,* The 
second moments may easily be computed by virtue of (3). 

CoBOtLARY. Under the assunptim (6) my -u, t) or w which tm at least one suffix 
number >sis mcorrelated mth all the others, and, if = 1 /or i = s + 1 , . . . ,y>. 
This results from (11) on putting joJ = 0 for t = s-t- 1, ...iP. 

3. We may now find the limiting distribution of , r^, the p-s smallest 
correlations. 

We substitute (10) in (4) and then divide each element by n. There results 
the equation 





,^ln 


4n 

%,s+l 

,Jn 

^jn 



fn 

•• 1}^ 

% 


^M+l 






Vi.i 

Ijn 

4n 

■ijn 

'**S+l,£ 

^Pl 

4n 

a/w 


’$ ■ 


■sjn 

'Hm 

^jn 

Pi + ~r 


Vl,l 

/w 

V 


Tm. 

^Jn 


4n 

••• 

4n 

^S+1.8 

/n 

bl 

^»i 

■4"5) 

Wm+1 


^Is+l 


%+l,8+l 

^jn 

^P.8+1 

^jn 

«"8+U 

sjn 

>Jn 


™»+l,S 





,,, ^s+i,a 

/a ^ ,Jn 



rwgi rwg, 

,^n ■" Ifn 



!3i£±1 

<fn 



( 12 ) 


The equation (12) has y—s roots which are o(l) for large n. To evaluate these, we 
substitute n ^ for r in (12), delete the common factor n~^ from the rows 


®-tl>« + 2,...,p,p-t-s-l-l.p+5 + 2, ...,p+q, 


* Cf, CramSr (1937), pp. 113-14. 
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0 ... 

0 

0 

... 0 

p'l 

0 

0 

... 0 

0 ... 

- 0 

0 

... 0 

0 

... />; 

0 

... 0 

0- ... 

0 


... 0 

Vl.1 

... '^8+1,8 

^s+l,s+l 

••• 

0 ... 

0 

0 

... -1? 

V 

• • • ^3>a 

^y.8+1 

■ • • l^pq 

p'l 

0 

0 

... 0 

0 

... 0 

0 

... 0 

0 ... 

p3 

0 

... 0 

0 

... 0 

d 

... 0 

^1.8+1 ••• 

^s.a+1 

*^s+l.a+l 

• > • 

0 

... 0 

-P 

... 0 

... 

% 

%+!.<! 

• • • '‘^pg 

0 

... 0 

0 

... -71 


0, 


i.e. 



= 0 . 


.(13) 


By (9) the left-hand side of (13) is equal to 




d. 


'S+I,8+l 


-r 


*s+i,a 


"(bS+l 


... dga-9/2 


where 


dii= S (i,j =5 -1-1 p). 

( 7 = 8+1 


.(14) 

■( 16 ) 


I'®! Ca+n in descending order of magnitude, be the latent roots of the 

matrix II dy ||. Then the p — s roots of (12) which are o(l) for large n are 

n-%i + o{n-^) (i = s+l,...,p). 

If we define ^,+ 1 , ..., by putting 


= (i-=s + \,...,p), (16) 

then the ^’s have the same limiting distribution as the ^"s. Hence the hmi+ing 
distribution of the may be derived as the distribution of the latent roots of 
11 11, in which the d’s are regarded as having a distribution which is the limiting . 

distribution described in Lemma 2. By virtue of the Corollary this is the dis- 
tribution of [p -s)(q- s) mutually independent normal variates with zero mean 
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and unit standard deviation. Therefore* the limiting distribution of the = nr\ 
has the density function 


»-» 
U-1 


ft ft (C("Q 

{b«8+1 


■lyrii]-'! 

/ V \««-p-w I , X A 

X ( n S<l expl-i 2 Cl . 

\i-»+l / \ <»»+l / 


<»>c< 


'«+! = 




The transformation Ci = gives the following density for the limiting dis- 
tribution of the (i =8 4- 1, f)'- 


■■;Vp)=‘ 2P-®-to~«)(«-a)7rKP-») |^n - 8 - i + 1) 

P P \ / P \C— \ 

n n n vi] s vu> a®) 


4. We now proceed to find the limiting distribution of By virtue 

of (9) we may write the left-hand side of (4) as 


li *11 . • ■ ttlj) 

11 

^11 ••• 

, G = 

^11 

1 

• 


1 ®pp 


bpi • . . 


% °(?e 


where 

A = 


Hence, if we set = rf {i=l, 

the 6^ are the roots of the equation 

IBC-iB'-^A] = 0. 

Substituting (10) in (21) and dividing each element by n, we get 

l(A-h«-‘V)(I-l-u-iW)-MA'+»“*V')~0(I+»-^*U) ! « 0, 

where 

U 


•• 

• 

V 


... 


w = 



, V — 




1 Wpi • 

• '“pp 



•” ^PS 

1 




A - 

Pi 

... 0 

0 

... 0 




0 

- Pp 

0 

... 0 


Wn .. 



w*. 


.(19) 

>(20) 

.( 21 ) 

.(22) 

.(23) 

..( 24 ) 


♦ Hsu (1939), pp. 266-7. 
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Neglecting higher powers of w~^ we write I — for (I + and carry 

out the matrix multiplication to the term with n~^ as a factor. There results the 
equation 

lAA' + w-‘(VA' + AV'-AWA')-^(I + «-tU)| =0, (26) 

i.e. 

n~Kp[ Wpi + p’p Vip - p'lp'p Wip - Otiip) 
'^'Hp'iVii+PiVi^-p'iPiVhi - <5%2) pi® -(? + «.-M2p' Ugj -P2®«^22- <9^22) • • • 

(P2 'Opi + pp '^ip - P'iP'p ^2P - ^'^ap) 


'»'~-(p'iVpi+PpVi^j-p'iP'j,U)ip - dujp) n-Hp'^Vj,g+ppVzp-p^ppW^p - ... 

p'p^-d^ -Pp^vipp - SUpp) 

= 0. (26) 

On account of (6) there are roots of (26) which are p|+o(l) for large n. To 
evaluate these we substitute Pi + n~^pi^ for 6 in (26). Since the first of the 
p'’s are equal to pj, there will be a common factor n~^ in each of the first pj rows. 
After deleting this factor and then letting n->-oo, we get the equation 

^^n-Pii'^ii + ^n)-^ Ui2+%-Pi(«i2+Wi4) ... 

Vii+Vii-Piiun + Wiz) + 




= 0, (27) 

i-e. zii-^ ... Zi,„ 

=0, (28) 

■“ ^ 

where Zfj = Vij + Vji-pi{Uij+Wi^) {i, j = l,. ..,p^) (29) 


Let be the roots, in descending order of magnitude, of (28). Then the 

Pi roots of (26) which are pf + o(l) for large n are 

pl+n~^Pi^'i + o(n-i) (i = l,...,pi). 

If we define Ci, ..., hy putting 

0i=- p\ + n-ipiCi (i = l,...,Pi), (30) 

then the ^’s have the same limiting distribution as the ^"s. Hence the limiting 
distribution of the may be derived as the distribution of the latent roots of 
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the matrix II ||, in which the u\ v’s and w’s are regarded as having a distribution 

which is the limiting distribution described in Ijemma 2. 

Now, if all the u's, -u’s and lo’s are normal variates with aero mean, so are the 
z’a. Owing to the fact (ii) of Lemma 2 all the z’s are uncorrelated. Using the 
formulae (11) to calculate the variances of the z*8, wo easily obtain 

^(z|^) = 4(l-pt)^ = 2(1 -/)*)* (i+j). (31) 

(t,i = 

Hence «u = ~ Pi) (1 " P?) (i4=i). (32) 

{i,j = l,...,/ti), 

where the <’s are mutually independent normal variates with zero mean and unit 
standard deviation. 

Setting ^ = 2(1 -p^)?; in (28), wnget, by (32), 


ki-'f) 





^-%i 


1. 


(33) 




| = 0. 



Let be the roots of (33) in descending order of magnitude. 


The density function 

(277)-W/h+» exp ( - i(«fi + . . . + + . . . + (34) 

is equal to (27r)"bn(/‘i+i)exp| — ^ S > (35) 


which is a function of the latent roots only. Hence* the distribution of the latent 
roots has the density 

= n ( 36 ) 

\i=“i / U=i j \ 'j=i J 

-CO- 

The density function (36) represents the limiting distribution of i;^, where 


6i = pl+2n-^Pi(l~p\)n’i (i = l,...,/q). (37) 

Hence d\ = p^ + n-i(l~pl)7|'^+o[n~i) {i=l, 

Ifwedefinei/i, ...,i?^^by 

ri^ Pi+n-i{l-pl)7ii (i==l,..../4i), (38) 


then the % = «*(1 -Pi)“H^{-Pi) have the same limiting distribution as the-'^j. 
Hence the limiting distribution of has the density /( t/^, ..., 9 ?,,^). 

In exactly the same manner we may prove that for ib = 1, ..., v the 

9/i = -ai(l-/)|)-Un-Pfc) (»=/<i+...+/tfe_i+l,...,/<.i+...+/tU 

* Hsti (1939), p. 266, Theorem 2. 
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have the limiting distributioiwepresented by the density 
vrhere, in general, 

/ m \— 1 (mm ^ \ 

n H exp ( 39 ) 

\i>=l / } \ i^X / 

CX^>X-^^ —00. 

Ihirthermore, the. sets (i?i, ■■■> (’?/»,+. Vs) 

are such that the equations corresponding to (27) for two different sets involve 
only mutually uncorrelated u’s, v’a and w’s owing to (h) of Lemma 2. Therefore 
the Hmiting distribution must be such that these sets are independent of one 
another. Also, recalling (14) and (i) of Lemma 2, it is seen that the limiting 
distribution of ^ 1 , must be such that the sets (t}i, iV/ti+i> ■■■>V/ii+M>)> •••> 

(W...+/< -i+n •■••’/a) ^“4 (Vs+i, ■■■> Vp) are independent of one another. 

In conclusion we sum up the results in the following theorem: 

Theorem. Let the population canonical correlations be p[, ...,p'p, where 

pj= ... = py, 

P/Jl+l = ••• = Pui+iti ~ Pz> 

1+1 *** P^ **"* Pv^ 

Ps+1 — ••• = Pp ~ 0, 

Pi>Pa> ...>Py>0. 

Let the sample canonical correlations be ry, where 

Let % = (i = l,...,3)). 

Then the limiting distribution of 7}y, ...,‘rip is represented by the density function 

fiVv •••.7^)/(7«+i. •■•’Vfii+fii) •••/(9/(j+...+/av-i+l> •“> Vs)fi{Vs+i>'->Vp)> 
where thefunctionsf andfy are given by (39) and (18) respectively. 
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THE APPLICATION OP MAXIMUM LIKELIHOOD TO 
DOSAGE-MORTALITY CURVES 

By P. GARWOOD, PhD. 


1, Ihteodttctioh 

Maky papers have been written on the fitting of dosage-mortality curves; in 
partioular, a paper by Dwin & Cheeseman (1939) summartzies the methods 
which have been adopted hitherto. It is felt, however, that there is some mathe- 
matical interest in the subject -which is worth emphaamtig, 

A typical problem occurs when studying the effect of some drug on a particular 
kind of animal. It is assumed that there is a population of animals, and associated 
with each individual animal is a certain lethal dose of the drug, such that the 
animal would always be killed by a stronger dose and would survive a weaker one. 
There is independent biological evidence for assuming that the logarithms of the 
lethal doses are normally distributed throughout the population, so that if the 
proportion of animals expected to survive a given dose is converted into a probit 
(i.e. an equivalent normal deviated 5), then the above assumption is equivalent 
to stating that the probits are linearly related to the logs of the doses. If the mean 
(or median) log lethal dose is m and the standard deviation is cr, then the linear 
relation between probit and log lethal dose is 


where 


1 ^ 6-a 

cr = - 2 , and m = — ^ . 

P P 


The experimental material consists of k groups, drawn at random from the 
population, of %, Wj, . , . , animals, which are given doses with logs Xi, a:*, , . . , a;*,, 
from which there are Sj, . . . , Sji, survivors, and % - s^, “ Sj. ,v . , % - s* deaths. 

The treatment which has hitherto been applied to data of this kind consists of 
obtaining from tables the probits y^,y ^, ... corresponding to the proportions of 
survivors q^ = plotting the y’s against the corresponding log 

doses a;i,a!j,... and fitting a line to the points, bearing in mind the following 
considerations. 

Since dqjdy = -Z, Z being the ordinate of the normal curve, and as the 
variance of q is, PQ/w, Q ( = 1 — P) being the expected proportion of survivors, it 
follows that the variance of y is PQjnZ^ which in general varies along the line, 
Thus different weights must be used for the various probits in fitting the line. The 
effects of using different methods of calculating the weighting coefficient 
w - nZ^jPQ (reciprocal of the variance) have been compared by Irwin & 
Ch^seman (1939). 
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A further difficulty occurred iu the cases of zero and all survivors, for which 
the corresponding probits are infinite, fisher’s method (Bliss, 1935),- using the 
method of maximum likelihood, overcame this difficulty by replacing the infimte 
probit by a working or fictitious probit in the regression equations. 

Mathematically, Fisher’s exact method of calculation is as follows (see also 
Bliss, 1938 jand Fisher & Yates, 1938). Assume rough values % and in the 
relation Y^a^+b^x. 

For each value of x this formula gives P and Q (the areas of the normal curve 
up to and beyond the probit Y), Z (the ordinate. at F) and w = nZ^jPQ. The 
regression is then found between the variate 

y = Y + tj = ari+bjx + 7j, (1) 

where , - 


and X, giving weights w to the former. The result is a new regression equation 


where 


Y = a^+b^Xf 

8wx(y-y) 
Sw{x -x)^‘ 


and 


ttj = y-b^. 


It is to be noted that this form of the regression equation is more convenient 
for our purposes than the form Y = y-k-b^{x—x). Substituting the values of y 
from ( 1 ), it follows that 

, , Swx{y — y) _ _ , 

^2 = ^1+ s i^ ~x f ’ a2 = ai+9/-a:(6a-6i). 

Hence the changes Sa = a^~a^ and <J 6 = 62 — in the regression coefficients of 
y = r+^ on a: are in fact the regression coefficients of ij on x, and they can be 


regarded as the solutions of the normal equations 

da8w +db8v)X = Sioifj, (2) 

daSwx + 8b8wx^ = Svjoctj. (3) 


The new regression equation Y ~ a^-\-b^x is then made the basis of a similar 
calculation; i.e. the regression is calculated between d^ + b^x + y and x (the values 
of ri will be changed since Q and Z are in general altered), giving another equation 
7 = 03 + 63 a; and so on. The process is continued until no change occurs in the 
coefficients. 

It is possible that some arithmetical labour might be saved by obtaining 
the corrections Sa, Sb to the regression coefficients a, b at each stage, instead of 
the new coefficients a+8a,b + Sb, by calculating the regression between 7 / and x, 
but this has not been investigated. The process of obtaining the corrections is 
illustrated later (Table III). 
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Matdmum likelihood mid dosage- mortality curves 

In general the suocessive coefficients and will converge to limite (and the 
successive corrections Sa, Sb to zero), and it is not difficult to mw that these limita 
are the solutions of the maximum likelihood equations. The prcajess is, in fact, the 
same as the general method suggested by Fisher for solving maximum likeliliood 
equations, as will be shown later. Also, a consideration of the foundations of this 
method shows that there is a method of obtaining the maximum likelihood 
estimates which is slightly more rapid (as regards numbers of apjiroxiinations) 
than that outlined above. 

It is one of the objecte of this note to point out that the problem may be 
regarded as one of estimating the parameters m and cr (or equally, a. and /)), 
i.e. of fitting a normal curve to the data. The fact that this is equivalent 
to fitting a line to the theoretical relation between probits and log lethal doses is 
only a consequence of the special nature of the nonnal distributitui; . It may happen 
in other applications that the distribution of the log lethal doses is nob normal and 
cannot be normalized by any transformation of the log lethal doses but has 
another form depending on unknowir parameters; then the problem can only be 
regarded in general as the estimation of parameters from observations. 

In the case of the normal distribution the method of plotting probits is of 
course a very convenient method of representing the data and of obtaining a good 
general picture, but it is to be emphasized that from the theoretical viewjtoint it is 
at least equally important to interpret the problem as one of estimating parameters 
as to regard it as a problem of fitting a regression line in the ordinary sense of the 
term. 


2. Gbkebal maximum likemhood bstimates 


It will be convenient to recapitulate the method used by Fisher for solving 
maximum likeHhood equations (see, e.g., Koshal, 1933). A sample 
is drawn at random from a population of which the frequency function has a 
known form depending on a unknown parameters so that the proba- 

bility of obtaining the sample is 


•^2’ *»> dp Op ..., Og). 

If the variates a: are independent of each other, as in the case of suocessive samples 
from the same population, then P is the product of n probability functions 
OpOp OpOp On the other hand, they will not be inde- 

pendent if they are a set of frequencies with a fixed total. 

The maximum likelihood estimates of the unknown parameters di,0p ... 
based on the information provided by the sample, are the values which satisfy 
the maximum Ukehhood equations 9P/3(?i = 0, dPjdO^ = 0, .... Using the likeli- 
hood function r r/ n o V , ^ 

L — ^*^2* • • • I ^2» * * 0 ^ -P > 

the estimates must be solutions of 9i/3di = 0, etc. 
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Since they are functions of the sample, these solutions can be written 
. . . ), ^ 2 . ■ • ■ . etc- ! if approximations to are found by 

some rough method, suppose that etc., so that 811 , 81 ^, ... are the 

errors in the approximations. Then we have 

etc., ignoring terms in 8 t\, etc. Writing • 

•••) — •f'n 
d^L 

-g^(ii,/a> ■■■) — etc., 

the equations become 8 tj^ + +...=- X^, etc. 

or = (i= (6) 

^=1 

The solutions of these equations can be written 

8 ti = — (i = 1,...,5), 

i=l 

wlbere, in matrix notation, {l^^ = {Xy}~i, the reciprocal matrix of Thus if 
A is the determinant formed by the elements L^j, and the determinant obtained 
by omitting row i and column j, then 

7 ^11 7 “^12 7 7 

rii — ~ 2 ” ! na — ““ 2 ~> ‘13 — “^) •••) Hj — ~ ^ • 

For two variables 

7 — 7 _ 7 _ f'la 7 _ -^11 A — T T Ti 

ril “ ~ ‘■21 “ *'22 “ 2 r ’ ^ ~ “ ^ 12 > 

and 8 ti = ^ 12 -^ 2 ) > ^^2 ~ — (^ 2 if'i”i^ 22 -f' 2 )- 

The corrections 8 t will not be exact, since terms of higher order have been omitted 
from (4) ; however, if the process is repeated with ti + St^, -f St ^, . . . , now used as 
the first approximations, the corrections obtained will be of the next order of 
smallness, and the process can be continued until the approximations t are as 
near as desired to the exact estimates T. 

The coefficients are functions of the observations as well as the approxima-^ 
tions t; it is in some practical cases more convenient to replace the a:’s by their 
expected values, i.e. the values which they would be expected to have if 0^ = 

Biometrika xxxii 4 
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(92 = ij .... Let yly be the resulting values of Ly, so that equations (S) become 


with solutions 


6k 




(i = 1, 


where {Ay} = {/ly}~L 

As before, the corrected values i + 6t can be used as the b^is of the new calculation 
and the process repeated to any desired accuracy. It is to be expected that the 
replacement of Ly by its expected value Ay will result in the approximation being 
less rapid; this is confirmed in particular examples by calculations given below, 
the difference being small, however. 

The method has been used by Koshal (1933, 1939) in fitting a Pearson Type I 


to a set of frequency data. First approximations to the maximum likelihood 
estimates of the four unknown parameters fct, (or /? 2 , 0^, O/j) were found 

by actually calculating a set of values of L and estimating its maximum position. 
The above method was then applied, In this case, if a typical group frequency is n 
and the expected proportion is p, we have, 8 denoting summation over the groups, 


L - constant + ;Snlog 2 ), 


S-fo 

p 


and Lij^~8^PtPj+8^Pf^. 

As before, denotes dpjdd^ andpy denotes 9®p/90{30^. The expected value of the 
last term is 

9 ^^ 

N8pij = = 0 , 

where N = Sn = total frequency, so that 


Ay = 

P 

which were the values used by Koshal. 

The covariance matrix* of the estimates T is approximately equal to (Ay) 
(Fisher, 1922); thus the variance of is approximately - A^i and the covariance 
of and is approximately -- Ajj. The degree of approximation is such that 
terms of a higher order in 1/w are omitted. 

To apply the method to dosage-mortality problems, suppose the probability 
of death is a function P of the dose (or log dose) x and of unknown parameters 

* This has been used by Irwin & Cheeseman (1939) to derive the formulae for the 
variances and covariances of the estimates cf and b of the parameters in the lethal dose 
distribution. 
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01, 02, ... . The combined probability of the given set of survivors is 

s ! {«. — s ) ! 

so that L = const. -|- /S[(n — 5 ) log P + s log Q], 

and = 

where q = sjn — observed proportion of survivors; therefore 

L,, - 

, . „nPiPj 

and Ai^==-S-p^- 

If the distribution of lethal doses (or log lethal doses) depends only on para- 
meters of scale and location, we have 

P = P(a+/?a:) = P(r); 

therefore P^ = P'{(x,+ fix) — Z, and P^ = xZ, 

Where Z is the ordinate of the frequency distribution; thus 

L, = S^{Q-q), L^ = S~{Q-q). 

Putting ■ = C = and ^ = w>, 

we have = S^, — Sx^. 

Also = Lafi = Sxt', Lpp=^Sx%', 

nZ{Q-q)/Z' Z Z\ ,,nZ- 
^ PQ \Z~y^Q) -^PQ 

= -8w + 8n^, ( 6 ) 

1 . Z' z z 

where 

Similarly = — Svix + 8/ix^, (7) 

= - 8wx^+ 8 fix%. (8) 

The expected value of ^ is zero, so that 

^a« = - Sw, A„,J = - Swx, App = - 8wx^. 

The equations for the corrections 8a, 8b to approximations ai,b^ to the 
maximum likelihood estimates, using the “expected” coefficients A, are thus 

8a8w +8b8wx = 8v}r], 

SaSwx + SfiSvxc^ = 8 wx7}, 

i.e. the same as equations (2) and (3). Thus Fisher’s exact maximum likelihood 
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method of correcting the regression line is exactly equivalent, in the case of any 
distribution defined by parameters of scale and position, to the above method of 
calculating successive approximations to the maximum likelihood solutions. In 
the case of the normal distribution, it should ^Te noted that if a and b are the 
maximum hkelihood estimates'of a and /d, then * = (5 — a)/6 and s = 1/6 are the 
maximum likelihood estimates of the mean and standard deviation m and m, 
since it is easy to see that these values satisfy dLjdm = 0, dLjd(r = 0. In other 
words, the problem can equally be regarded as the estimation of the population 
mean and standard deviation from the data. 

Furthermore, it may happen from some pecuUarity of the experimental 
material that each experiment consists of one item, i.e. = ... = 1, and 

the number of survivors is either 0 or 1. For example, the dose might represent 
some quantity which can be measured but not controlled. 

Provided always there is independent evidence on which to base the assump- 
tion of the normal distribution (or of some other known form), there appears to 
be no reason why such data should not be effective for the purpose of drawing 
inferences about the population. 

It is true that for the purpose of testing the hypothesis, say of normality, it 
will be necessary to group the data, and this will also be efficacious in obtaining a 
provisional probit line, i.e. first approximations to a and 6; but for the exact 
estimation of the population parameters this is unnecessary (and would, in fact, 
result in a loss of information), for there is no difficulty in carrying out the 
calculations given above, or illustrated later in Tables II and III, with the values 
of q equal to 0 or 1 . On the other hand, the exact problem cannot be regarded as 
one of fitting a line to the plotted probits, since all the latter are infinite in one 
direction or the other. 

Another convenient way of regarding the problem of finding the maximum 
MkeUhood estimates is a geometrical one. We require to find the values a and 6 of 
a and fi which are such that = dL{a,,fi)lda and = dL{a,fi)ldfi are zero. 
Regarding a, /8, y as cartesian co-ordinates in three-dimensional space, we require 
to find the point P{d, b, 0) where the two surfaces y = y = and the plane 
Y ~ Q meet. This is the same as the point of intersection of the curve = 0 in 
the plane y = 0 (the horizontal plane) and the curve = 0 in the same plane. 
If Pi («!, 6i, 0) is an approximation to P, the tangent plane to the first surface at 
the point vertically above P^ outs the horizontal plane in a line which is near the 
first curve. 

Using co-ordinates Sa,db with reference to as origin, this line has the equation 


dL{a-J)y) 

9a 


-h 


a2P(ai6i) 

da? 


+ Sb 


8^P(ai6i) 

9a9yff 


= 0 , 


or in the simpler notation 


+ daig* -f 6bL^^ ~ 0. 
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Similarly there is a line near the second curve with the equation 

+ 8bL^p — 0 

and the intersection of these lines is a closer approximation to P than P^. The 
values of given in equations (6), (7) and (8) and the effect of replacing 

them by their expected values A^, etc. is to replace the tangent planes by planes 
through the points of contact differing shghtly in direction, and to replace the 
above lines by Mnes whose equations are given by (2) and (3). 


3. The goodness of fit test 


In the regression line treatment of the problem, the test of the hypothesis of 
normality is provided by testing the residual variance about the regression hne, 
with degrees of freedom, two less than the number of groups. Prom the point of 
view we are considering, it would appear more natural to calculate x^ in the 
comparison of observed and expected frequencies, i.e. 




{s - 


nP 


nQ 


„n(Q-q)^ nZ^/Q-qY „ » 

- (-Z-) - • 

with k—2 degrees of freedom. 

The residual variance about the regression line is 

Sw(ij — xda — xSb)^ == Svnj^ — daSwrj — SbSwxTj, 

so that the two values of are identical when 8a = 0, 8b = 0, i.e. when the 
maximum likelihood estimates have been approached sufficiently closely. 


4. Comparison of methods of solving maximum likelihood equations 

8 

For convenience we refer to the method using equations S LijStj = —L^ as 
method I and that using the expected values of the coefficients, viz. 


as method II. Before comparing them arithmetically, it is of interest to enquire 
whether the two are ever identical, i.e. = Aij, for a “scale-location” distribu- 
tion. From (6), (7) and (8) this requires that 

Z' z z ^ 

^~z p^ q 


Now Z = dPjdY = P', so that the probabihty integral P musf satisfy the 
differential equation 


p-« p” 
^ “T'+TTp 


= 0 . 
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An integral of 
is 


P'' + P'^/(P) = 0 

p'eJAPjdP ^ 


so that P' = CP{1 ~ P), 

gS+or 

and hence P = i^gs+cr ‘ 

Since Y = we can choose values B — 0, 0 = 2 for the arbitrary con- 

stants, giving 


and ordinate Z = P' - \ sech*® Y which is the distribution given by Msher &, 
Yates (1938). Thus for this distribution /i = 0 and Lafi = 

hence equations (2) and (3) will give the most rapid solution. 

Tables for facilitating these .calculations have been given by Fisher & Yates 
(1938); it would appear that Tables XII-XIV of this work supply similar tables 
for fitting a distribution of the type P = sin* ^ = sin* (a -f ; here 

fi = -2cot2^, w = 4n, ^ =4«.(Q — o') cosec2^, 
so that = - 4Y- 88n{Q-q) cot 2^ cosec 2^, etc., 

and any advantage in rapidity gained by using method I is almost certainly offset 
by the simplicity in method II, since djja = -4Y,da^ = —iN8x,A^^ - -iNSx^. 
The point has not been tested, since no examples have come to hand in which 
a P = sin* ^ distribution has been envisaged. 

To apply method I to the normal distribution we have ordinate 


therefore 


Z = 

^2 71 

« V ^ 2 

^ = 5-r+^-p. 


The two methods of successive approximation have been applied to each of 
the following three sets of data taken more or less at random from those already 
published for illustrative purposes. 

Example (i). Antipneumococcus serum given to five groups of forty mice; 
Wilson Smith’s data (Irwin & Cheeseman, 1939, p. 179). 


Serum dose o.c. 

• 

oc 

Deaths out of 40 

0'000626 

-2 

33 

0-00126 

-1 

22 

0-0026 

0 

8 

0-006 

1 

6 

0-01 

2 

2 
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Since the increased dose resulted in more survivors, we call q the proportion of 
deaths. 

Example (ii). Mice injected with Bact. Typhi-murium, sample A; Topley’s 
data (Irwin & Cheeseman,, 1939, p. 180). 


Dose (mg.) 

X 

Survivors out of 6 


-3 

4 

0-125 

-2 

3 


-1 

2 

0-6 

0 

0 

1-0 

1 

0 

2-0 

2 

0 

4-0 

3 

0 


Example (iii). Brine shrimps, Artemia salina, in arsenical solutions having 
concentrations in geometrical progression (Fisher & Yates, 1938, p. 6). 


Solution 

a? 

Survivors out of 8 

C 

-3 

8 

D 

-2 

8 

E 

-1 

6 

F 

0 

5 

G 

1 

6 

H 

2 

1 

I 

3 

0 


In each case the scale of x has been chosen with a central origin for convenience. 
The last example was used by Fisher & Yates (1938) to illustrate the use of a 
table (Table XI) drawn up for solving these problems; as, however, the arith- 
metical work in this note has, for purposes of comparison, been taken to more 
places of decimals than practical work demands, this table has not been used, and 
the requisite areas and ordinates of the normal curve have been taken from tables 
of the normal curve (Pearson, 1930). The results of the comparison are shown in 
Table I. 

Most of the values given are probably exact; there may, however, be an occa- 
sional error of one unit in the last place. The first approximation in Example (i) 
is one used by Irwin & Cheeseman; as it has already been calculated from the 
data, the errors in the coefficients are smaller than in Examples (ii) and (iii). 
In Example (ii) the first approximation was found by fitting a line by eye to the 
observed probits, and in Example (iii) the first approximation was found by a 
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Table 1 . Comparison of methods of successive approximation 
to solutions of maximum likelihood equations 



Approximation 

Successive corrections to a 

and b 

Example 



Method II 

Method I 


First 

Final 






Sa 

Sb 

Sa 

Sb 

(i) 

r = 6-6461 + 0 ' 6689 iB 

r= 6 - 6644 + 0 ' 6761 a : 

liil 


■■ 

0-0071 



mn 


mi 

0-0001 

n 

Y = l + x 

r = 6 - 6168 + 0 - 8636 a : 

111 

Wl 

mi 

H|!l 














I 








HU 



0-0002 

WK\\ 

— 

— 

(iii) 

y =: 4 - 8 + 0 ' 6 a ; 

7 = 4-6668 + 0 - 7128 * 

- 0-0269 

0-0891 

- 0-0268 

0-0984 




- 0-0080 

0-0216 

- 0-0072 

0-0142 




- 0-0003 

0-0021 

0-0002 

0-0002 





0-0001 




Method II. Expected values of etc, used. 

Method I. Actual values of etc. used. 


preliminary regression calculation with the coefficients rounded off to one place 
of decimals. The arithmetical details of the calculations (Example (ii)) of the 2nd 
approximation by the two methods are given for illustration in Tables II and III. 

It is seen from a comparison of the results that there is a slight advantage, 
from the point of view of rapidity, in method I. On the other hand, method II 
entails a little less arithmetical work, and if Fisher & Yates’ tables are used it is 
probable that this advantage would be greater, although the point has not been 
investigated. 










Table II. Typical calculation of corrections to estimates of 
parameters, Example {ii), method I . 

lat approximation, y =■ 7 + x 


Dose 

K-) 

X 

n 


7 

7-5 



Z* 

T 

Q~3 

Z 

PQ 




PQ ‘ ^ 



6 

— 1 
4 

0-8 

-L 



mm 


1-8126 

O0749'3 

-0-2374 . 

-0-45638 

0-125 

bI 

6 

3 

0-6 

0 



039894 

BfluJJjM 

1-6958 

-0-16968 




Bl 

5 

2 

0-4 

1 

084134 



-0-24134 

1-8126 


0-2374 



o 

5 

0 

0-0 

2 

0-97725 


005399 

0-02276 

2-4265 





1 

■5 

u 

0-0 

3 

0-99865 




8-2874 


02785 



2 

5 

il 

0-0 

4 

0-99997 






0-2200 



3 



5 

0 

0-0 

5 






■iiiiH 




i^ = |S'?=-0-46228, = 0-63662 

iM = ‘Se' = -l'7629. = 3-1704, = Sx%' = -l-mfi 


da = 


L^L^s-LsLi 




0-8133, Sb = - -0-1952 


a= 7 


6=1 


a+Sa= 6-3867 


6h-^6= 0-8048 


2nd approximation, Y = 6-3867 + 0-8048a:. Pinal approximation, Y = 6-6168-1- 0-8636®. 


Table III. Typical calcuMion of corrections to estimates of 
parameters, Example (ii), method II 


lot approximation, Y =T + x 





B 

D 




Z* 

g 

■ ' z 

ut 

wait 

■Rl 


5 

4 


-1 


0-84134 



0-1708 

BiW 

jRIl 

^HSaKtnH 


5 

3 

o-a 

0 

0-50000 


0-39894 

-0-10000 

-0-2607 



0-25 

-1 

5 

2 

0-4 

1 

0-84134 


0-24197 

-0-24134 

-0-9974 

■ 0-4386 


0-5 

0 

6 

0 

0-0 

2 

0-97725 

0-02275 

0-05399 

0-02275 

0-4214 

0-1311 

0-0000 

1-0 

1 

5 

0 

0-0 

3 

0-99865 

0-00135 

0-00443 

0-00135 

0-3048 

0-0146 

0-0146 

2-0 

2 

5 

0 

0-0 

4 

0-99997 

0-00003 

0-00013 

0-00003 

0-2369 

0-0006 

0-0012 

4-0 

, 3 

5 

0 

0-0 

5 

1-00000 

0-00000 

0-00000 

0-00000 

0-1928 

0-0000 

0-0000 


Swrj = 0-6366 
—xSwtj = -0-8387 
Swtjix-x) = -0-3021 


Swijix-x) 

8v){x-ieY 


-0-2034 


6 = 1 

b + db= 0-7966 


(Ste’ = 6-9494 Sw = 1-6601 Swx = - 3-0118 
—SSvxc = 6-4640 £ = - 1-8142 

Sw{x~x)^ = 1-4854 Swi] = —0-4623 

^ = -0-2786 
~ccSh = -0-3690 
(5o = -0-6476 
0=7 

o-|-(5o= 6-3626 


2nd approximation, Y = 6-3626-1-0-7966®. Pinal approximation, P = 6-6168 -1-0-8636®. 

♦ Five significant figures were used for P, Q and Z, where possible, but only five places 
of decimals are shown in the table; similarly in Table III. 

t As M = 6 in each sample, it has been omitted for convenience from and from 
w, mi in Table III. 
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6. Summary 

The usual practical maximum likelihood treatment of dosage-mortality 
problems (consisting of the transformation, of percentage surviving into probits, 
adjustment and 'sveighting of the latter, and calculation of successive regression 
lines) is shown to be equivalent to calculating successive corrections to the re- 
gression coefficients, The process is exactly equivalent to the method, given else- 
where by Fisher, of obtaining the maximum likelihood estimates of the para- 
meters defining the distribution. A refinement of this method, using the actual 
values of the second derivatives of the likelihood function, instead of the expected 
values, converges a little more rapidly when applied to the normal distribution, 
but this advantage is offset by some extra arithmetical work. The two methods 
are exactly equivalent only for the distribution specified by P = |sech^2. 

The writer is greatly indebted to Mr E. D, van Rest and to Prof. R, A. Fisher 
for much useful help and advice. 
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A NOTE ON FURTHER PJIOPERTIES OF 
STATISTICAL TESTS 

By E. S. PEABSON 

De P. L. Hstt has suggested that I should write a short introductory note on 
the origin of the idea involved in his paper and in that of Er Simaika’s which 
follows.* In searching some twelve years ago for a systematic method of choosing 
the best test of a statistical hypothesis fl"#, Prof. Neyman and I came to the 
conclusion that an essential preliminary to any mathematical formulation of 
the problem was the definition of a set of admissible alternative hypotheses, 
C{H). Starting from this viewpoint, our first method of selecting a test involved 
the use of the likelihood ratio, but, however useful as a practical method of 
attack, the principle underlying this approach was somewhat arbitrary. A more 
fundamental procedure, later developed, was to choose a test paying regard 
to its power function, that is to say, to the chance that its use would lead to the 
rejection of Hq if an alternative oi C(H) were true. It then appeared 

that a number of statistical tests in common use had the remarkable property 
that they maximized this chance for every alternative to in C[H). Such 
tests were termed uniformly most powerful tests of jSo regard to 0{H), 
That there were limitations to the situations in which a uniformly most 
powerful test could exist soon, however, became clear. These limitations were 
gradually explored, and the following papers are further contributions to the 
subject. It was found that these tests generally, though not always, oon- 
berned the value of a single parameter. Such are tests of the hypothesis that 
a mean or a standard deviation has a specified value, or that the difference 
between two means or two standard deviations is zero. Further, in these oases 
the class of alternatives must be restricted; thus the two-sample <-test of the 
hypothesis that two population means and are equal, is only uniformly 
most powerful for the situation in which the alternatives considered are defined 
by > 0 or by — fg < 0 but not for both at the same time. 

In this connexion, in 1936, Kolodziejczyk was able to prove that for tests 
of a linear hypothesis, no uniformly most powerful test could exist if the 
number of parameters involved was greater than tmity. This result was im- 
portant, since the majority of tests used in the analysis of variance can be 
reduced to tests of a linear hypothesis. 

This hmitation of tests regarding the value of two or more parameters can 
be illustrated by a geometric presentation. Since, the most important features 
of the problem can be illustrated when is a simple hypothesis concerning 
the value of two parameters, I shall take this case, using notation already 
adopted in this connexion. 

* See pp. 62-69 and pp. 70-80 below. 
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Further properties of statistical tests 

Suppose that the elementary probability law of random variables 
whose particular values are given by observation, is of form 

di, di being the two population parameters. For a critical region w of size a 
associated with a given test, we may write 

P{J5ew;l(9i,(92} p{E\diyd^)dx^.,.dXn=^ (2) 

If the hypothesis jHq which w has been selected to test assumes that 

d,=ei, ( 3 ) 

then p{6l,(^\w) = a, (4) 

where a is the significance level chosen. 

A power surface may be obtained by taking rectangular axes for 0^ and 0^ 
in a horizontal plane and plotting fi{0x, 0^1 w) as a vertical ordinate. If w„ were 
a critical region associated with a uniformly most powerful test of Hg, then -its 
power surface would faU nowhere below the surfaces derived from other critical 
regions satisfying (4). No unique surface with this property will, however, in 
general exist. If, for instance, we choose Wg so that the surface will rise quickly 
in the direction parallel to the axis of 0^, we shall reduce the rate of increase in 
the direction of 0g, and vice versa. Power surfaces of alternative critical regions 
may, in fact, cross one another in a complicated way, but no single surface can 
everywhere lie above all others. If we confine attention to tests for which the 
power surface has a minimum ordinate of a at the point 0^, 0§, i.e. to unbiased 
tests of Hg, we shall still be unable to find a uniformly most powerful test in 
this restricted field. 

The dilficulty in choice between alternative tests can, indeed, only be solved 
by a further formulation of the requirements of a satisfactory test. Several 
lines of attack are open : 

(i) To lay down conditions for the form of the power surface in the neigh- 
bourhood of the point 0?, 0\. Here we may describe the objective as to make as 
large as possible the chance of detecting small departures in 0i and 0g from the 
values specified hj Hg. A method of approaching the problem from this point of 
view leads to the development of the unbiased test of Type 0 (Neyman & 
Pearson, 1938). 

(ii) To regard it as of more importance to control the form of the power 
surface at some distance from its minimum point; for example, to try to select 
a critical region for which the power surface reaches the level 

J3[6x,6g\w) = 0-m, ( 6 ) 

along a contour lying inside the corresponding contour associated with any 
other test. This method of approach has been examined by Dr B. L. Welch, 
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but his results are not yet published. It is possible that methods (i) and (ii) will 
lead to the same result. 

(iii) To consider whether from the practical point of View, if jHq true, 
the importance of the departure of the unknown parameters from dj, d\ can be 
measured by a single parameter, 

A=/(di,d,). (6) 

If this is so, we are in fact defining a system of contours on the dj, 6^ plane 
along any ofte of which we should like the ordinates of the power surface to be 
constant. Such a system would be defined, for instance, by 

(7) 

and if 6^ | w) is to be constant for values of d^, 6^ satisfying (7), the contoxirs 
of the power surface will be circles of radius A. Alternative tests would then be 
confined to those whose power surfaces had circular contours, would be the 
hypothesis that A == 0 and the uniformly most powerful test, if it exists, would 
be that for which fi{X\w) ' (8) 

for A > 0 and aU alternative critical regions w satisfying the conditions stipulated. 

The problem thus presented in the case of a simple hypothesis concerning 
two parameters will arise in similar form when is composite and concerns the 
value of many parameters d^, dj, ..., d^. In a number of multivariate problems 
we have reached a position in which ; 

(а) tests of statistical hypotheses concerning the values of several population 
parameters have been derived, as well as their power functions ; 

(б) these power functions have been shown to depend on the value of a 

single function A = /(d^, d^, . . . , d,) 

of the parameters considered. 

In the following contributions Dr Hsu and Dr Simaika have examined three 
of these tests, that concerned Avith the general linear hypothesis, with Hotel- 
ling’s generalized and with the multiple correlation coefficient. They have 
shown that of tests whose power function depends only on a certain function A 
of the population parameters, the existing tests are the uniformly most powerful. 
It is of course true that in the problems in question no alternative tests are. at 
present available or indeed likely to become so. Nevertheless, I believe that the 
discovery, resulting from Dr Hsu’s initiative, of the relationship between the 
test function and a corresponding comprehensive collective character in the 
population, has taken us a step farther in our miderstanding of the properties 
of statistical tests. Tufther, this relationship between and A, and ’^*, 
D^ and d*, JS® and p® seems to lead us round by another route to the problem of 
Statistical estimation. 
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ANALYSIS OF VARIANCE FROM THE POWER FUNCTION 

STANDPOINT 

By P. L. HSU 


A yresb: study on. the classical analysis of valance testa in the light of the 
N'eyman-Pearson theory was started by Kolodziejczyk (1936), who formulated 
the class of linear hypotheses for which these tests may be employed. As a linear 
hypothesis is defined relative -to the set of admissible hypotheses, the study of 
the .B^-test (by which we denote any test falling under the usual methods of 
analysis of variance) may be made with reference to its power function. P. C. Tang 
(1938) showed how the pQwer function was related to R. A. Fisher’s (O) distribu- 
tion (Fisher, 1928) and so was able to appraise the chance of detecting the falsehood 
of a linear hypothesis using the jB^-test. The great theoretical value of the power 
function lies, however, in its use in comparing the relative merits of alternative 
tests of the same hypothesis. In this paper we shall prove a theorem (p. 63) 
which asserts that out of a certain class of tests the JB^-test is uniformly most 
powerful. 

In his paper Tang has used an orthogonal transformation in the sample space 
which enabled the general hnear hypothesis to be reduced to the following simple 
form: Given the elementary probability law 

-,ym> exp [-^2 1^2 (2/i- , (1) 


where all real values of th , . . ., i/m and all positive values of cr are admissible, the 
hypothesis is that ( ^ m) of the t/’s have the true value 0; 


Vi = i?2 = ... = Vm = 0. 


We call the above hypothesis Hq. 

We shall set n n 

W‘ .s//{ 






(2) 


( 3 ) 


and call Wq 
inequality 


(of size e) the critical region for the rejection of JBq defined by the 

(4) 


where is a constant so determined that the probability that (4) is true, given 
that (2) is true, equals e. 

The power function of Wq as given by Tang can be written 


ft-o Je,‘ 



P. L. Hsu 


63 


where X = (6) 

An outstanding feature of the power function '(5) is that it depends on the single 
parameter A. Our problem is, does there exist another critical region of size e 
whose power function depends on A alone and which is more powerful than Wq for 
certain values of A ? The answer is contained in the foUo’iV’ing theorem and is in 
the negative. 

Theorem. Suppose that the critical region w satisfies the following conditions: 

(а) w is of size e, 

(б) the power function of w depends on the single parameter A. 

Let /?(A) be the power function of w and /?g(A) be the power function (6) of Wq. Then 

(7) 

for all positive values of A. 

Proof. In the place of Zj, ...,Zn substitute spherical co-ordinates, viz. the 
radius vector r = and n - 1 angles, ..., We deduce from (1) that 

Pit/i, r) = piVi, -,ym)P{ym+v 

( 8 ) 

where p{y ^, . . . , = {^{2n) cr }-^ exp | - , (9) 

PiUm+i 2/m) = S {yt-ViA, (10) 

( i-nx+l I 

p(r) = 2"i<”'~*V“"'(T'J«)~^'r’‘''^exp|— (11) 

and p(di, ..., ^„-i) is the well-known product of cosines which involves none of 
the parameters i?i, °'' 

We now make the following successive transformations: 

n, 

r = 8i, s = t-Zyl yi^t% (i = l,...,ni), (12) 

1=1 

and also write (»=!> •••>%)• (13) 

It follows that 

piy ni+l> ■••’Vm) 0 

= iP(2/nx+l.”-.2/m)25(<9l.---.^n-l)i’(%. -".“ni. <). (l^) 

where 

p(%, ...,u„^, t) = (.^2cr)-<"'+"J7r-l’‘(r^»)-ie-'^<«”+'‘i-2)exp^-^j 

/ »i „\l(n-2) /Jt ”1 \ 
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Prom now on we shall write y for the set of variables y^^+i, ‘••,ym dy for 
^^jii+n • s-iid use similar abbreviations 0, u, dd and du. 

Suppose now that the critical region w satisfies the conditions (a) and (6). 
Let r{y,d,u,t) be the characteristic function of w, i.e. r{y,d,u,t) = 1 or 0 ac- 
cording as the sample point falls within w or not. Let W be the sample space. 
Then the power function of m is 


/d(A) = I r{y,0,u,t)p(y,d,u,t)dydddudt, 
jw 


(16) 


whence, 


1 - 2 ) 


{f2 J r(y, 0, u, t) p{y) p{6) 

X exp ^ — 2^) S dydddudt = e, (17) 

/ t \ / • /Jt "i \ 

X exp I exp I ^ 1 dydddudt 

= 7ri»i r{^n) e^p{X) = F{X) = F > say. (18) 
Let Wi be the sample space of 0, u and t, and put 

(^2 cr)"<"+”i) J r{y, 0, u, t) p{0) ii(“+%-2) 

exp I - ( 1 - exp j dddudt 


Then, by (18), 

/* CO 

o J —00' 


f*tX3 I* 00 

••• Hy)p{y)dy = o, 

J — 00 J — 00 
'ni+l> Yl 


-J’(A) = 0(y,7,or). (19) 

(20) 


[ I m 

y„^,(r)exp(^-^ S fi 


xexpj S cCiyAdy, 


ni+1 

ni+l'”%m = 0, (21) 


on writing for (2fr*)^i (i = «! -I- 1, . , . , m). . 

Equation (20) must hold true for aU real values of the a’s. Hence it follows 
from the well-known theorem on Laplace transformation* that 


'/ 1 ™ \ 

9i(y,r,(r)exp|-^_^I^^y?j = 0, 


2cr*- 

t=n,+l 

* Of. Doetsoh. (1937), p. 36, Theorem 1. Though the theorem referred to is stated for the case 
where the number of y’s is one, it may easily be extended to the case of more than one y by 
induction. 
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i.e. (p{y, y, (t).= 0, >vhence, by (19), 

(V2cr)-(»+»i) f r(y,e,u,t)p{d)ti^”^-^'> 

X exp ^ ^ 

In particular, from ^(y, 0, tr) = 0 and (17) we have 
(^2 (t)-<"+"i> f J'(?/, (9, «, «) p(0) <*<“+*‘1'"®) 

Jw'i 

( t \ / «i \i<»i-2) - 

""2^/ ( ^ ~ 7 dddudt = 7r*"iP(i^») e. (23) 

Letting be the sample space of 6 and u, we get respectively from (23) and 
(22) that 

/•oo 

(^2cr)-<»‘+»i)J 

x&x.p^—^^dt^^r{y,6,u,t)p{d)^-^ui^ dddu = n^'^ir{\n)e, (24) 

rco / t \ r /m \W»-2) 

(V 2 tr)-<»+»»i) <K»i+ni- 2 )exp ^ J ^ r(y, 6, u, t) p{d) |^1 - j 

X exp (tl. dOdu = F r? j • (25) 

Hence, on developing the left-hand side of (26) into power series in the y’s, we 
must have 


j: 




( 2<r^) 


dt 


r / ni \i(»-2) / 71, \h 

X J r{y, d, u, t) p{6) 1^1— 2 ulj S YtUij dddu = 0 for odd h,. (26) 

poo 

2“i(w+ni) Qr“K7i+ni+2A) I ^K^+^i~2)+A 

Jo 

{ t \ C / ni \i<«-S)/ Hi \2/t 

x&s.py—~^dt^^r{y,d,u,t)p(d)\l-^ulj ytri^ij dOdu 

= aft(.Sr?)'‘ ■(;i=l,2.3,...). (27) 

Further, equations (24) and (27) may be written as 


j: 


^l(n+%— 2) q;j^p 


( 


; m (i - suif'-^e^u - .] ,« = 0, (38) 


Biometrika xxxii 
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/•co ! t, \ c' / Jii •\Un~^) 

Equations (28), (26) and (29) must hold true for all positive values of tr. 
Hence, by the theorem of Laplace transformation,* the functions within the 
square brackets in (28) and (29) and the inner integral. in ,(26) must vanish 
identically: 


C /Hi \V.n- 2 .) { «» \h 

r{y,6,u,t) p{6) ( 1- il ^ti I ( S yiuA dOdu ~ 0 for odd A, 
Jw, \ i^l I u=i I 

r / "> \i(«-2)/ tl, \27i 

r{y,d,u,t)p{0)li-'Z'i4] (sri% 

Jw, \ i~l / \i=.l / 


(30) 

(31) 




m 


(A=l,2,3,...). (32) 


“ 2'>'r{l(n + nj) + h} 

From (31) and (32) we iiifer that 

r{y,&,u,t)p{d){l-'^uA exp s ri%U(9dw = G S r? . (33) 

Jw, \ t-l / Xi^l / \i”l / 

Now for any given values of y and t the integral r[y, 9, u,t)f{9, %) dOdu 

Jw, 

equals f{d, u) d&du, where is the set of points in the sample space of 9 and u 

' J w» 

for which r{y, 6,u,t) = 1. Hence (30) and (33) are equivalent to 
C / ”> \K»-a) TT^'^'^rdn) 

/* / n, / »1 \ / "> \ 

J “ i?i“V ^ 

The conditions (34) and (36) axe necessary and sufficient that the critical region 
w should have the properties (a) and (b). 

On the other hand, from (12) we have 


i?* = S «?; 

4-1 


hence Wq is the region defined by the inequality 


S «? > El 

* Of. footnote, p. 64. 


(36) 

(37) 
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Since is of size e, we must get the same right-hand side of (34) when in the left- 
hand side we substitute for w^. Hence 

/* ! \i(n-2) r / ih \Hn-H) 

p(d) 1 - L dddu = p{9) 1 - S dddu. (38) 

J w, \ i=l J JiOo \ i=l / 

Let J j>( 0 )( 1 - exp( 

With the help of (37) and (38) we may now appeal to the lemma proved in the 
Appendix and conclude that 

<?{A)<G'o(A), (40) 

whence, replacing y^ by ar-^^tyi in the integrals in (35) and (39), 

2>(^) (l - .S ^exp i^^yiU^ dddu 

r / »i \i(^— 2) fU ni \ 

< J exp|^^ SyiMijdddw.. (41) 

If we multiply both sides of (41) by 

( “ 2 ^) ■ 

and integrate over the sample space of y and t, we get, in accordance with (18), 

/?(A)<^„(A). 

Hence the theorem is proved. 

APPENDIX 

Lemma. Let ^(a;)>0 be defined for x'^0 and vanish for x>l, suoh that 
g{v\ + ...+v^) is hummable. Let f{wi, be summable. In the product 

space of the v’s and w’s let Rbea region such that 

I f{wx, -.^wj g{v \ ■+■...+ u|) exp (y^Vi + ...-^y^v^)dvdw = (?(y| -h . . . -1- y^). (1) 
Jb 

Let Wq be the region defined by the ineqvnlity 

v\ + (2) 

Let I /(wi, ...,M;Jgf(v? -(-... -t-v®) exp (yiUi+... 4- y„i;JdvdM;= G'o(ri+ ••• +y^). 

V -Ro 

(3)* 

Finally, let 

fiw.^, ...,wjg{vl+ ... + vl)dvdw = f{w.L,...,wJg{vl+ ... + vl)dvdw. ( 4 ) 

JB JB. 


Then 0{x) < ti'o(*) 

for all positive x. 

* Notice that (3) k not a separate condition on Sg, but is implied by (2). 


( 5 ) 


5-a 
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Proof. In (1) we set = a:, = ... = = 0, and get 

0{x^) = f{w^, ...,wjg{vl+ ... + vl)ex-p (xvfjdvdw. 

JR 

This and the conditions on / and g imply that Q is continuous. 

Multiplying both sides of (1) by exp and integrating over the region 

we get 


K!\i^^-^')e-^G{x)dx 
J a 

= f{w^,...,wjg{lvl)dvdw | 
J Jfi 


exp(~£yl+2:yiVi}dy, ( 6 ) 

J a^Syi‘^b 

where Z is some numerical constant. Applying a rotation in the space of the y’s 
to the inner integral in the right-hand side of (6), we obtain 


Ja 


= f f{w^,...,wJg{Ev^i)dvdw\ 
Jr 




exp { - + (Uvj)^ x-j} dx 


~ 1j ca4(-®)> 


where 4(2?) =-i j^/(M;i, 


0»-/ 


a^Sxi‘<,b 


X^QXp{ — Ilx\)dx. 


Similarly, we have 


Sca4{1?o). 

Ja ft=0 


An appeal to. a general lemma of Neyman and Pearson,* on remembering 
(4), leads to the inequality 

4(i?)<4(i?o). 


Hence 


j: 


a;«n- 8 ) e-^I^G(x) — Gg(x)} dx ^ 0 . 


Since a and b are arbitrary and since the integrand is a continuous function, the 
latter must be < 0. Hence G{x) < Gq{x). 


* Neyman and Pearaon (1936), p. 11. 
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ON AN OPTIMUM PROPERTY OF TWO IMPORTANT 
STATISTICAL TESTS 

By J. B. SIMAIKA, Ph.D. 


P, L. Hsu (1940) has shown that for any linear hypothesis the H^-teat is the 
uniformly most powerful of all the tests whose power function depends on a 
certain function, A, of the population parameters. Two other tests of importance, 
namely, those associated with the multipip correlation coefficient and Hotelling’s 
(Hotelling, 1931), have the similar property of being uniformly more powerful 
than all other testa whose power functions depend on the respective functions 
of population parameters involved in the distributions of and It is the 
purpose of this paper to estabhsh such an optimum property of these two tests. 
We shall consider them separately. 


I. Hotbixing’s 

The general problem that calls for the T^-test may be stated in the following 
way: given the elementary probability law 


( a 9 

= cij{, Sy = 8jfi), 


(I) 


it is required to test the hypothesis that 

(i=l,...,g). (2) 

Hotelling’s teat consists in calculating 

= i (3) 

where denotes the general element in the matrix ||5y 11~^, and rejecting the 
hypothesis if (4) 

where Tf is a constant so determined that the risk of rejecting the hypothesis 

when it is true equals e. 

The distribution of derived from (Ij, which conforms with Fisher’s (C) 
distribution (Fisher, 1928), was obtained independently by Hsu (1938) and Bose 
& Boy (1938), and may be written 

p(T*l^,a) =p(T2||^z) 

^ (y2)i9+ft-l(l .p yzj-Km+sWij (5) 




A ! B(^2 + h, ^m) 

a 

i.i=i 


where 


( 6 ) 
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Hence the power function of the T^-test is 

'p{T^\f^)d{T% 


jTe’ 


(7) 


which depends only on the function of the ij’s and a’s. Our first theorem asserts 
that the y^-test is uniformly more powerful than any other test whose power 
function is a function of alone. 

Theoeem I. Let Wq (of size e) be the critical region defined by the inequality (4), 
and w be any other critical region whose size is e and whose power function is a 
function offr^. Let and be the power functions ofw and w^ respectively. 

Thpvt 

fiirxMr)- ( 8 ) 

Proof. Let us first find a necessary and sufficient condition that w should 
have the properties described in Theorem I. We have, from (1), 

p{y,s) = Ke-i'^ \ f ay(Si^ + ?/<?/j)+ S ai^y^yA 


Hence, on setting 
and 


Ui. 


i, J— 1 

+ !.•••> 2). 

a 

(i=l,...,q), 

1-1 


i,l=i 


we have p(y,u) = Ke-i'^\u^^-yiyj\^'^-^&x.pi-\ f a<^%+ S ) 

\ i,l=l i=l ' 


and 


= 4 S oAKdi, 

i.i—l 


where denotes the general element of the matrix || ||“^. 

If w is of size e and has a power function depending only on then 

■^1 -J S a^^u^^dydu == e 

Jw \ i,^=l / 


(9) 

( 10 ) 

( 11 ) 

( 12 ) 

(13) 

(14) 


and 


l««-yi2/3l‘”‘~’-exp(-J S S ^iyi)dydu = e^y{i/r^) = F(i/r^),my. 

J tv \ ifj=l i=l / 

(15) 

It follows from (15) that, on expanding the left-hand side into a power series in 
the ^’s, we must have 


[ I ’u-ij-ViVi [1™“^ exp ( - 1 S ( S dydu = 0 for odd h, (16) 

Jw \ i,j=l / \i=l / 

nL ' l^^^^exp dydu 


K 

{2h)\ 


(A = 1,2,3,...), (17) 


where the a,, are numbers depending only on the region w chosen. 



72 On an optimum property of two important statistical tests 


On the other hand, since the integral of (12) over the sample space W is unity, 
we have 


-^1 l%-2/iyil*"*“^exp(-i S ayMy+- 1: ^tyAdydu = 

Jw \ i=.l / 

(18) 

whence 


(19) 




(A= 1,2,3,...). 

(20) 

Combining equations (14) and (19), (17) and (20), we obtain 


J 

Iw<j-yi2/il*’"“^exp(-i S ayMy)%dM 
w \ i,i=l / 



= e 1 My -yty^l exp ( - S ^ 

(21) 

•J 

1 % “ 1*”*"^ exp 1 ayMyj dydM 

= lw«-2/t2/i |‘”*-^exp^ - ayMyj djydu 



(i^= 1,2,3,...). (22) 


The sample space W is the product space W{u)x.W{y\u), where W{u) is the 
sample space of the u’s and W[y\u) is formed of the possible positions of the 
point (y ^, . . . , t/g) for given values of the it’s. Similarly w = W{u)x w^y \ u) . If we 
evaluate the integrals in (21), (16) and (22) as repeated integrals, we obtain 

f exp(-i f aijuJ 

JwM \ / 

r f I \^”'''^dy-e f | Ui^-y^yj du = 0, (23) 

f exp ( - i i; r f j My - yiyj ( S %1 dv,=- Q for odd h, 

J Jf(u) \ i= 1 / L J to(l/M \1=« 1 / J 

(24) 

I exp ( - 1 S ay My ) 11 1 My - y^y^ jl™-! ( i ^^yJ dy - O;, 

J IFW \ i,3 = l /LJ«>to|M) \i=l / 


M, 


W(y\u) 


'ij-yiVj 


I im-l 


dy1dM = 0 




(A = 1,2,3,...). (25) 


Since equations (23); (24) and (2.5) must hold true for all admissible sets of values 
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of the oc^j, so, according to the lemma proved in the Appendix, the functions 
within the square brackets in these equations must vanish identically. Hence 


I 


m(2/|m) 


- ViVi 1*““^ # = e f I % - ViVi (26) 

J W(y]u) 

f i - Vi Vi I i S % = 0 for odd A, (27 ) , 

f I “ij - ViVj 1*”*"^ ( S ^dy = a,J j ( E £i2/i) % 

J w(vfu) \i=l / J Wivlu) \t=l / 

(A = 1,2,3,...). (28) 

In order to simplify the above equations we notice that the matrix ||%jf|l, 
being positive definite, can be throwm into the form CC', where C is a non- 
singular real matrix. Using the transformation 

II 2/l> • ••) 2/g ll = II *1) •••) *9 II ^ J 


we get that 


(29) 


r / a \iffl-i C / a \lm-i 

l-Ea:?) dx^el l-S*! dx, (30) 

J io(a!|ii) \ <=1 / JlV(»|«)\ i“l / 

r / 3 a \A 

1 1 - S *1 1 I S dx = 0 for odd A, (31) 

J«!(a!|u)\ i=l / \i-l / 

(• / a \im-i / a \2fe 

f.SM d^ 

J uiCaJlu) \ i-1 / \i=l / 

c / a \im-i / a \2A 

= aA 1-Sa:? SM dx (A = 1, 2, 3, ...), (32) 

JW(xlu)\ 1=1 / \i=l / 

where llg„ ...,^,|| = H?,, ...,^9110. (33) 

Now IF (a: I m) is the region 8 (independent of the w’s) defined by the inequality 

(34) 

i=sl 

Hence the integral in the right-hand side of (30) is a numerical constant, say b, 
and a rotation in the space of the a:’8 enables the integral in the right-hand side 
of (32) to be written as 

/a /• / a \im-i 

Hence we obtain the following equivalents of equations (30), (31) and (32): 

r / 3 \4m-l 

dx = be, (36) 

Ji«(q:lw)\ / 

/ a \4m-l / 3 \h 

( 1 — E a:|) IE dx — 0 for odd A, (37) 

J «<al«) \ i=\ j \i=l / 

/• / a \ im -1 1 q. \ih /a \h 

(-LM dx^bA^^l] (A = 1,2,3,...), (38) 

J uKilu) \ i=l I \i=l / U=1 / 

where the are numbers depending only on the choice of the region w. 
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The set of equations (36), (37) and (38) give the necessary and sufficient 
condition that the critical region w should have the properties described in 
Theorem I. Further, equations (37) and (38) may be combined into the 
following: 

C / a \im-i (a \ / 3 \ 

l-s*| exp s » S . (39) 

i=l / \i=>l / \i=l / 

Now according to (29) we have 

S = + n (40) 

1=1 i,i=l 

where denotes the general element of the matrix || ||. Hence is the region 

defined by the inequality 

(41) 

1=1 

Since is of size e, we must have the same equation as (36) when w is replaced 
by Wo therein. Hence 

(• / 9 \im-l Cl 8 \im-l 

l-s*? d,x^\ l-Sa:! dx. (42) 

J«)(x|m)\ 1=1 / J lOoX 1=1 / 

/• / 9 \\m~l I a \ / 9 \ 

Letting (l-S»i exp ( S giarJ da; = t^o ( t h (43) 

Jw,\ 1=1 / \i=i / \i-i / 

we deduce with the help of (41), (42) and the lemma proved by P. L. Hsu in the 
Appendix of his paper (1940) that 

(44) 

Applying the transformation reciprocal to (29) to the integrals in (39) and (43), 
we get 

I |%-2/i2/^|*’""^exp(2:?^2/i)%=5 1 \Uij-yiyj\^"^-'^exp(^^iyAdy. 

\i=I / JwMu) \i=l / 

(46) 

Hence on multiplying both sides of (45) by K exp ( - i/r^) exp I and 

integrating over IF('a) and remembering (16), we have the inequality (8): 


which was to be proved. 
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II. Multiple CoREELATrosr Coeepioient 
In this connection the basic elementaay probability law is taken to be 


•••,^ 9. 3:11,2:12, ••.,2:99) = 


2: Vi 
Vi ^11 


Va 




X- 


19 


2/9 ^91 


X, 


■99 


xexp(-|yz- S (1) 

\ i=i i,j=i / 

where = 7, S fijj (2) 

is the square of the multiple correlation ooefhcient of the population. We have 


2 2/1 

2/1 


Va 


‘-19 


Vq ^qx 


^QQ 


= (3) 


and that the square of the multiple correlation coefficient of the sample is 

jB2 = 1 S 


(4) 


The hypothesis to be tested is that 

^, = 0 (i = l,....g). (6) 

Theorem II. The basic elementary probability law and the hypothesis under test 
being given by (1) and (6), let Wq be the oritical region of size e defined by the inequality 

( 6 ) 

and w be any other critical region whose size is e and whose power function depends 
only on p^. Let fi{p^) and figip'^) be the power functions of w and Wq respectively, 

(7) 

Proof. Suppose that w has the properties described in Theorem II. Then 

\V,n-g-2) 


\i(ji-9-2) 


( 8 ) 


f /a \i(tt-<l!-2) / 2 \ 

4(»-8-2 ) \^- Z_ ViVi^ exp - iyz - i s j dzdydx = e, 

J \^ij ^2 - ^ S yiy)j 

X exp ( - \yz - i s % - S PiVA dzdydx 
\ i=l / 

= (1 = jP(p2), say. (9) 
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Hence, on developing the left-hand side of (9) into a power series in the J3’s, 
we have 

I* / a \Kn-a-2) 

Jw \ / 

X exp( -iya-i S ( S Al/i'l dzdydx = 0 for odd h, (10) 

\ / \i-l / 


_^r 

(2^1) !j„ 




«« l««-9-2) p y^y^ 

X exp - iyz - S dzdydx 


= (A = 1.2,3,...), (11) 

where the ti/, are numbers depending only on the choice of the region w. 

On the other hand, since the integral of (1) over the whole sample space TF 
is unity, we have 

/• / a \ i<n-9-*) 

if J ^ I aiy ViVij 


whence 


X exp ( - iya - 1 i - S /^iVi] dzdydx = {I -p*)-*’‘, (12) 

C /a \«n.-a-a) / a \ 


K 

m 


iL'”" 


*(n-g-2) j 




(13) 


/ a \ / ® 

X exp - Jya - 'L^a-n^’Cn j ) dzdydx 

r{{n+h) 


fe!r(|») 

Combining equations (8) and (13), (11) and (14), we obtain 
r / « \K«-a-a) 


(pV (A = 1,2,3,...). (14) 


(15 


xexp(-^ya-i S (ZijX^Adzd/ydx — e[ {...)dzdydx, 

\ ty-i / Jw 

/ a \ / a \ 2 ft 

X exp ^ - iya~ ^ 

= aftr (...)dadj/da; (A= 1,2,3, ...), (16) 

V ^ 
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where the unwritten integrands in the right-hand sides are the same as those 
in the left-hand sides. 

As before we argue that W = W{z,x)y.W{y\z,x), w = W(z,x)x w{y \ z, x) and 
evaluate the integrals in (15), (10) and (16) as repeated integrals. It follows that 

L i a,,x^\\ (z- i xiiy,yr'"''\y 

JWlx,x) \ ij=l / LJ tt>(u|is,a:) \ vJ=l- / 

r / a , \i(n-Q- 2 ) -I 

-e Iz- ^ x^^yiyA % dzda: = 0, (17) 

JW(V\IS,X)\ i,j=l 7 J 

JW(s,x) \ id=l / 

Q / 3 \««-a-2)/ a \h 1 

iz- S x^^yiyA ( dy \dzdx = 0 for odd h, (18) 

u)(i/|a,a!)\ i.j=l / \t=l / J 

f |a:y|«»-8-®exp(-iyz-i S 

\ i,i=l / 

r /• / 3 \i(n-a- 2 ) / a \2ft 

(z- s SA2/<) dy 

LJ ui(!/|a. a) \ i,j=l 7 \i=l / 

/• / « \i(m-3-2)/ 3 \2n 

-“a 2- S dj/olzda: = 0 (A = 1, 2, 3, ...). 

(19) 

According to the lemma proved in the Appendix, the functions within the 
square brackets in the above equations must vanish identically. Hence 

r / 3 \i(»-3-2) r 

Iz- S x-<'^yiyA dy = e\ {...)dy, (20) 

Jui(3l*,a:)\ i,i=l / J Wiv\e,x) 

r / 3 \4(n--a-2)/ a \A 

2 - s I s dy = 0 for odd ft, (21) 

J«j(i/la.»)\ t,l“l 7 \<“1 / 

/• / a \i(n-a-2)/ 3 \2ft 

2 - s dy 

J 13(1/1*, a;) \ -(,1=1 7 \4=1 7 

= aAf (-Ody (ft =1,2,3,...). (22) 

Jt3(i/|e,a:) 

In order to simplify the above equations we notice that, since the matrix 
11 II is positive definite, it can be thrown into the form CC', where G is a non- 
singular real matrix. Using the transformation 

Ilyi.-.ygiH 2*11*1. •••»*3 11 G'. 


(23) 
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we obtain 

/ a \««-a-a) r , ^ 

1-St!l dt^ei {...)dt, 

(24) 

J at) 

\ 1=1 7 

J W«ka!l 


r i 

a \i(n--g—i) 

/ a \A 

(26) 

I 1^- 

-S<l 1 

S Tik] dt= 0 for odd h, 

i wdb,*) \ 

1=1 / 

\i=i / 


f / a \K»--9--2) / 

t a 

■ ®/t f (h =s 1, 2, 3, . 

..)> (26) 


STiti) dt = 

J at) \ 1= 1 / ' 


JwHiz.x) 


where 





Now W{t\z, a:) is the region (independent of z and the a;’a) defined by the 
inequality 

(27) 

i=i 

Hence the integral on the right-hand sidp of (24) is a numerical constant, say b, 
and a rotation in the space of the i’s enables the integral on the right-hand side 
of (26) to be written as 


/a r / 0 \il(7i-a-2) 

Si i~s«i trdt 

\i=i- / js\ i-l / 


(28) 


Hence we have the following equivalents of (24), (26) and (26): 


f 

J w{t\s,x) 



[ 

II 

1 

r 

1 

6e, 

(29) 


J w(tfz,x) 

V 1-1 / 


r / 

a ' 

iKn-a-z)/ a 



( 

1 - s<? 

I S rik] dt == 

0 for odd h. 

(30) 

J lOtfl*,*) \ 

1=1 / 

' \i=i / 


a ) 


a \2ft. / a 

\h 


1-Si 

1=1 J 

1 ( 

S r^tA dt = bA S' 

t-i / \i=i 

r?j (A = 1,2,3,...), 

(31) 


where the 6^ are numbers depending only on the choice of w. 
Equations (30) and (31) may be combined into the following one: 

r / 9 \l(n-a- 2 ) ' / a \ / a \ 

exp - Sr,<, Si . 

Jw(il2,iB)\ 1=1 / \ 1=1 / \l»,l / 

Now, by (4) and (23), we have' 

1=1 

consequently, the region is defined by the inequality 


(32) 


( 33 ) 
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Since Wq is of size e, we must have the same equa;bion (29) when w{t \ z, x) is 
replaced by Wq therein. Hence 

r I a \}(ri-s- 2 ) Cl 2 

l-Sil dt=\ 1-S<I dt. (35) 

r / 2 \i('n.-!Z-2) / 9 \ / 2 \ 

On setting 1-S^I exp - S = (?o S , (36) 

Ji«, \ i=l / \ i=l / \<=1 / 

we infer, with the help of (34), (35) and the lemma proved by P. L. Hsu in the 
Appendix of his paper (1940), that 

(37) 

Hence, using the transformation reciprocal to (23) to the integrals in (32) and 
(36), we have 

r / a \4(u-a-2) / a \ /• 

(z- I, x^^yiyA expl - S (...)%. (38) 

J \ / \ i=l / Jw»(v|«,ae) 

Multiplying both sides of (38) by 

^( 1 _ p2)i» I I i(,i-a-2) exp I _ lyz - ay ®y j 

and integrating over the space W (z, x) and remembering (9), we obtain 
Therefore Theorem II is proved. 

I am gratefully indebted to Dr P. L. Hsu for putting this problem before me 
and for his helpful suggestions both in the course of my research and in preparing 
this paper for publication. 


APPENDIX 


Lemma. Let E{x) be the set of points {x^nx^^^ x^fj for which the symmetric 

matrix ||a:y || is positive definite. Then 

f I a;y 5 i(a;) I exp ( - S aJy ) da: < C30 (i,j = 1, ...,g) (1) 

jE(a!) \ i=l / 

and 

I 5i(a;)exp ( - S dx = 0 throughout E{a) (ay = x^^, ay = a^) (2) 

jE(a) \ i=l / 


imply that 


f>[x) = 0 almost everywhere in E{x). 



Pmj, Suppose that both (1) and (2) are true. Since the matrix, 
where 4 - 1 and - 0 (i4=j), - Gji, is positive definite for all sufficiently 

small real d’si 80 , by (2), 


E(®) 


-S% exp -i:V</Ua: = 0 






( 4 ) 


for all sufficiently small real d’s. By (1) the left-hand side of (4) is an analytic 
function of each of the ^’s in the neighbourhood of the imaginary axis. By 
analytic continuation (4) must remain true for all complex ffs with sufficiently 
small real parts. In particular, 


lE{x) 


M exp ( - S % exp l*pi S J dx ^ 0 (5) 


\ i=‘i 




for all real values of h. Hence, by the well-known property of the Fourier 
transform, 

^(x)exp I - S Xifj « 0 almost everywhere in E{x), 
which implies (3). 
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MISCELLANEA 


(i) A recurrence relation for the semi-invariants of 
Pearson curves 

By M. G. KENDALL 


The Pearson curves are defined, by the differential equation 

y(a+«>)dx 

6(, + 6ia! + 6ja:^' 

Multiplying by e*"'(6o + bjjx + b^x^) and integrating over the range of the distribution, we have 


j6*'°y(a + x)dx = j(ba + b^x + b^x^)6‘‘‘dy 

= [(6o + ^'ia: + 62 a:‘')e‘* 2 /]- J*«/da!e““{6i + 2ba® + f(6„ + &ia! + 6ja:“)}. 


At the extremes of the distribution we may suppose the expression in square brackets on 
the right to vanish and hence 


j6‘*j/{a4-6i + 6o^ + (l + 26a + bit)® + i>ata;®}da; = 0. 

The moment generating function of the distribution, Af(f), is given by 

M{t) — je*^ydic, 

and hence = (e*'’xydx, etc. Thus from (1) 
dt J 

d^M dM 

ba^ — i (1 + 262 + ^ {a + bi + bf)t) M = 0, 


•( 1 ) 


.( 2 ) 


a linear differential equation of the second order, which may also be regarded as defining the 
Pearsonian system. 

Incidentally, it would be interesting from the theoretical view-point to consider classes 
of frequency distributions defined by differential equations in their moment or semi -invariant 
generating functions. 

So far as I know there is no solution of (2) in ordinary functions which woxdd permit of 
the explicit expression of the co-oflicionb of P in M{i); but from a consideration of the co- 
efficient of P in (2) we have 

{1 -t- (r-f- 2) ba} +1 + + 1) bi}/t^-t-rbo/t,_i = 0, (3) 


the well-known recurrence relation between the moments of Pearson curves. 

Some simplification of this expression is possible by the choice of a particular origin in 

certain oases. If the roots of t t . t. « « 

bo-t-biaj+bjiB® = 0 


are real, it is possible by a real linear transformation to transform the equation defining the 
Pearson curves to one which does not involve b^. With the origin defined by this transforma- 
tion, we have {l-Kr + 2)b2}^;+i-i-{a-(-(r-bI)bi}/<; = 0, 


giving 


, _ . _ ^ (a-H-bi)(a-fr-lbi)...(q4-bi) ^ 

(l-f7+lb2)(l-frb2)...(l + 2b2)‘ 


(4) 
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Putting K ~ logM in (2), we have for tho semi -invariant generating function 

j + + + ~ 

This is not linear, and it appears therefore that there is no simple recurrence relation among 
the semi-invariants as among the moments. The equation is similar in character to that 
knowni as Riccsti’s and the usual way of solving it would be to return to the linear equation 
(2) from which it was derived. 

Taking an origin at the mean (aTi = 0) and considering tho co-ofiicient of f in (6), we have 


(>■ 


^f+l , L 1^1 ^r-1 , ^r~3 , , '''r-l , / 1 I nL ^ ^r+1 , k — o 

i 1 -f (r + 2) bj} (Cr+l +’■^>1 «r + rb, ^ ^ j 

+ |’’T^jAr,+iAV„y+... + i’’ j = 0, .... 


.(6) 


with the initial relation A'a = — 6o/(i + 36j). 

Equation ( 8) seems to be as simple a recurrence relation as wo can expect for the expression 
of a semi-invariant in terms of those of lower order. 


(ii) A comparison of annual and biennial inflorescences of 
Daucus carota (wild carrot) 

By william DOWELL EATEN 
Michigan State College 

Introduction 

In 1932 seed.s from Michigan and Indiana were gathered from Daucus carota for the ptirpose 
of studying environmental effects on tho numbers of pedicels and bracts per inflorescence 
from plants grown from Michigan and Indiana soed.s. In 1933 these sood.s were planted in the 
botanical gardens of tho University of Michigan in the green house and later planted outside. 
In 1933, 44 %oftho plants bloomed; in 1934, 17 % of those that did not bloom the first season 
survived tho winter and bloomed. Results of this study were published by the present writer 
(1934) in an article entitled "A statistical study of Daucus carota”, in which tho numbovs 
of pedicels and bracts on annual and biennial inflorescences coming from those seeds were 
compared. At the end of the article K. Poaraon pointed out that since the seeds wore taken 
from many plants in the wild, some of the seeds might have come from flowers blooming the 
first season and others from those blooming tho second season, that one did not know how 
many annual and biennial seeds came from the two states and that the comparisons might 
not he the same if this was considered. 

To overcome this just criticism seeds were taken from one plant near Ann Arbor, 
Michigan in 1936. These were planted in 1937 in the greenhouse at Michigan State College and 
later planted outside in rows 3 ft. apart and 3 ft. apart in the rows. During the latter part of 
the summer, counts wore made on plants blooming the first year of the number of branches, 
the number of inflorescences, and the number of primary pedicels and bract per inflorescence 
on the stem and first eight branches below the stem terminal cluster. During the summer 
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of 1938 similar counts were made on the plants blooming the second year. The object of 
this articli' is to compare the counts pertaining to the annual and biennial inflorcscenoes. 

During the first flowering season 60 % of the plants bloomed. At tho end of this season 
tho plants which did not bloom appeared to bo in goo<l condition for the coining winter. 
In 1938 biennial flowers appeared much oarlior than tho annual flowers in 1937. Coimts of 
the annuals were made in 1937 in August and September; counts of the biennials were made 
in July and tho first part of August. 

The terminal inflorescences on the stem will be designated by T, the first branch terminal 
inflore.scenoe by A , the finst non-terminal inflorescence on the fir.st branch by A j, etc. Branches 
are considered in descending order below the stem terminal. According to these notations, 
Dj represents the third non-teiminal infloreecenc© on the fourth branc.li. Umbels in this 
article will always moan primary umbels and pedicels or ray.s will always mean primary 
pedicels or rays. 

Size op annual and biennial plants 

The following averages pertain to the number of branches and inflorescences (including 
buds) of annual and biennial plants. 



Annuals 

Biennials 

Parts 

(1937). 

(1938) 

Average no. of branches 

16’3 

20'3 

Average no. of inflorescences 

122'7 

282’5 


These averages indicate that the biennial plants were much larger as to number of 
branches and inflorescences than the annual plants. The second year herb.s were considerably 
taller than those blooming the first season. 

In 1938 most of tho branches used in making the counts had four umbels, whose parts 
could be enumerated; in 1937 very few of these had four umbels whicli were mature enough to 
use. Very few of the first branches belonging to 1937 plants produced more than two non- 
terminal umbels; a good percentage of corro.sponding branches of 1938 plants possessed more 
than two. Counts were made on seventy -seven plants during the first .season and on seventy- 
six during the second. In the second summer there were .several plants with more than 600 
inflorescences and one with 796; the largest in 1937 had 183. 

In 1937 there was 37-7 % of the herbs with at least eight branches; in 1938 there was 
72'4 % with at least eight similar branches. In tho first flowering season 4:6'8 % of the plants 
had at least six branches; during the second .season 90-8 % had at least six branches. There 
were 74-0 % of the annual plants witli at least four branches and 97-4 % of the biennials with 
at least four. Those figures show that the biennial plants were more completely filled out than 
the annuals. 


Size of inflorbsoencbs 

Table 1 contains averages pertaining to the number of bracts per umbel on the stem and 
the first three branches . On the average the number of bracts on tho stem and branch terminal 
clusters of biennials are significantly larger than similar annual clusters. The average size 
(in number of bracts) of stem umbels for annuals was 10-9 bracts; that for biennials was 11-9 
bracts. The average number of bracts per branch terminal was less than 10-3 bracts during 
1937 and greater than 11-4 bracts during 1938. These figures and figm-es pertaining to the 
first eight branches indicate that the averages of the number of bracts on the majority of 
the biennial clusters are significantly larger than similar averages with re.spect to annual 
clusters. 
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Table 1. AveTogea and standard deviations pertaining to the number of bracts 
per umbel on the stem and first three branchea 

1937 



T 

A 

A 



B 



B, 

0 

Cl 

— 

Ca 

Ca 

Number 

77 

T1 

55 

18 


76 

69 

49 


68 

62 

39 



Average 



9-4 

8-6 

— 

10'4 

9-6 

9-6 

— 

10-3 





Standard 

deviation 

1-22 

13S 

1‘22 

l-Ol 


1-66 

1-33 

1'30 


142 

1'21 

1'27 



1938 


, 

IN'iunber 

— 

76 

76 

33 

— 

27 

7 

76 


29 

— 

IS 

71 

30 

25 

21 

Average 

11-9 

114 

9-8 



11-6 





10'2 

10-9 

11-2 

Standard 
deviation 
_J 

h35 

1'20 

143 

1-63 

1-77 



1-20 

141 

144 

1-16 

MO 

1’32 

146 

1-23 


Biennial stem terminals had on the average aignifioantly more pedicels than stem 
annuals j these averages are: 

Annuals Biennials 

56‘6 pedicels 67'8 pedicels 

Branch terminals of biennials have on the average significantly more rays than similar ones 
on annuals. Fig. 1 allows the eye to see at once how these averages compare; the heights of 
the bats on the left represent the averages for the annuals. The bars on tlie left are shorter 
in every case. 



Fig. 1. Averages pertaining to number of pedicels per branch terminal 
umbel for annuals and biennials, a, annuals; b, biennials. 


Many of the averages of the numbers of pedicels on non-terminal biennial clusters are 
significantly larger than those belonging to corresponding annual clusters. The above indi- 
cates that biennial inflorescences (in number' of bracts and pedicels) are significantly larger 
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than corresponding annual inflorescences, showing again that Dattcua carota herbs blooming 
the second season are on the average much larger than those blooming the first. 

On examining average number of pedicels per umbel it is found that branch non-terminal 
umbels have on the average a smaller number of pedicels than the corresponding branch 
tenhinals; for example, and A, are significantly less (in number of rays) than A. This was 
tnie for the other branches. The 1938 averages for C, C^, 0, and C, are as follows: 

G Oj 0, Cs 

61 -7 rays 46 -3 rays 46'3 rays 6l'l rays 

Similar figures were found for the other branches. The averages pertaining to pedicels are 
shown in Table 2. 


Table 2. Averages and standard dsimlions pertaining to the number of pedicels 
per umbel gn the stem and first three branches 


1937 


— 

T 

m 



B 

B 

Bi 


B, 

C 

r' 

Ci 

G, 


Number 

77 

77 


IS 

■ 

76 

69 

49 


68 

52 

39 

■■ 

Average 

56'6 

63-2 

44-6 

40-2 


64-3 


46-5 

— 

55-6 

46-8 

46-6 


Standard 

deviation 


10'98 

7-40 

8-96 

1 

11-32 

8-81 

9-37 


12-20 

8-70 

8*18 



1938 


Number 

“ ! 

76 

33 

27 

7 


30 

29 

15 

M 

30 

26 

21 

Average 

64-26 


Itigi'il 

48-07 

48-43 


47-23 

49-46 

47-13 


46-33 

49-32 

6M4 

Standard 

deviation 

13-10 

9-78 

9-98 

6-27 

8-79 


9-93 

10-31 

10-60 

1 

U-62 

10-07 



CoBREIiATION 

The Pearson linear correlation coefficient between the number of bracts and the number 
of rays for various umbels for annual and biennial umbels are as follows: 


Umbel . , , 

T 

A 

B 

0 

Annuals 


0-410 

0-496 

0-571 

Bieimials 

0-666 

0-612 


0-649 


All of these coefficients are significantly different from zero, showing that there is a definite 
relation between the number of bracts and the number of rays. There are no significant dif- 
ferences between the correlation coefficients pertaining to annuals and biennials except that 
for T which is barely significant at the 5 % level. These values suggest that the size of the 
plant and season do not effect the relation between the number of bracts and rays per umbel. 
Similar figures were found in other investigations of this species (Baten, 1934). The position 
of the umbels on the plant also does not affect the relation between bracts and rays. 
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The coeifloieiits of correlation between the number of bracts on T and on the other 
clusters pertaining to annuals and biennials are about the same and arc significant, indicating 
a real association betwaon bracts on stem and branch terminals and similarly for rays. The 
values of arei 



Bracts 

Rays 

Annuals 

Biennials 

Annuals 

Biennials 


0-644 

0-696 

0-849 

0-741 


Size of herb and season have no effect .on the relation between bracts and rays on T and on 
A, B and C. 

The amounts of dependence of the number of bracts on branch non-terminals have on. 
the number of bracts of terminals for the first three branches were obtained by the correlation 
coefficients between these respective numbers. There are no significant differences between 
these ooeffloients, suggesting that the size of plants and seasons do not affect the relation 
between the nmnbor of bracts and rays on branch terminals and branch first non-terminals. 
This also was true for rays. 

The following figures are the coefficients of correlation between tlie number of bracts and 
rays on first and second branch terminal hvflorescences. 



Bracts 

Bays 

Description 










Annuals 

Biennials 

Annuals 

Biennials 

r,(R (interclass) 

0-748 

0-734 

0-916 

|||||||RH||h 

‘‘‘A3 (intraclass) 

0-701 

0-710 


0-786 


These values indicate no significant differences between the correlations pertaining to annual 
and biennial inflorescences. They do suggest a rather high correlation between the number of 
bracts and rays on first and second branch primary umbels, 

The relation between floral parts on B and T and A is manifested by the following 
multiple and partial correlation coefficients. 


Description 

Bracts 

Rays 

Annuals 

Biennials 

Annuals 

Biennials 



0-796 

0-666 

0-683 

0-682 



Again there ate no significant differences between these coefficients indicating that the 
amoimt of relationship remains the same between these floral parts pertaining to annual and 
biennial inflorescences. 
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Suggestions eob eubtheb study 

It might be argued that biennial plants should naturally be larger in every way since 
these plants had a longer time in which to establish themselves than the annuals; that the 
root systerh of the second season plants are much better for supporting the plants than those 
of the first season. This may be true. To overcome this criticism and to make more reliable 
comparisons between annual and biennial plants and inflorescences it might prove of real 
value to secure seeds from one plant as done in this study, save seeds from the annual and 
biennial flowers, and plant these seeds under the same environmental conditions and then 
make comparisons between the counts made in this study. Seeds should be planted in the 
fall and in the spring. Investigations along these lines may produce more interesting results 


SUMMAEY 


This study has shown that: 

1 . The average number of branches on biennial inflorescences of Daucm carofa is larger 
than the average number on annual inflorescences. 

2. The average number of inflorescences on biennials is larger than on annuals. 

3. Tho average number of bracts per biennial clusters is larger than the average on annual 
clusters. 

4. The average number of primary rays on biennial umbels is larger than that on annual 
umbels. 

6. Tho correlation coefficient between bracts and rays is about 0*60 for annual and 
biennial clusters. 

6. The size of plants and seasons (first and second) do not affect the amount of correlation 
between floral parts on stem terminals and branch terminals. 

7. The amount of correlation between certain floral parts on branch terminals and non- 
terminals is about the same for annuals as biennials. 

8. The amount of correlation between bracts and rays pertaining to first and second 
branch primary umbels is about the same for annuals as biennials. 
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THE LAWS OF CHANCE, IN RELATION TO THOUGHT 

AND CONDUCT 

INTRODUCTORY, DEFINITIONS AND FUNDAMENTAL 
CONCEPTIONS 

BEING THE EIRST OE A SERIES OE LECTURES DELIVERED BY 
KARL EEARSON AT GRESHAM COLLEGE IN 1892 

[It is just fifty years since Karl Pearson took up the part-time appointment of Lecturer in 
Geometry at Greaham College in the City of London. This appointment, which he held 
during the years 1891-1, involved the delivery of certain courses of public evening lectures. 
His first course on ‘The scope and concepts of modem science’, oommenoed on 3 March 
1891; much of its material was afterwards published as The Orammar of Science. Later 
series of lectures dealt with ‘The geometry of statistics’ and ‘The laws of chance’. The 
lecture printed below was found among Pearson’s papers ; it was delivered on 1 November 
1892 and was the first of a series devoted to the theory of probability. — Ed.] 

In everyday life we feel, and justifiably feel, irritated with the man who is 
perpetually asking us to define the words we use. We are wont to reply that we 
use our terms in the ‘ ordinary’ or ‘customary’ sense. As a general rule mankind 
understand each other in ordinary intercourse and do not stop to discuss the 
meaning of words. But in important and delicate business or in legal contraefs 
the accurate definition of the words employed becomes of the utmost weight. 
Even more urgent still is clear definition in the matter of scientific investigation. 
It will not do here to appeal to that vague or floating sense of a word, which is 
termed the ‘ ordinary ’ or ‘ customary’ one, for hardly any two persons use the 
same abstract word for precisely the same range of ideas. What is atiU more 
remarkable is the change which the meaning of words undergo in a few genera- 
tions, so that even the language of our grandfathers requires to be read in the 
light of their (and not our) customary use of words. Take words apparently, so 
simple as Nature, Right, Belief, Iiaw, Chance: what a gulf separates the field of 
ideas we associate with these terms in 1892, from that which was their ‘ ordinary ’ 
or ‘ customary ’ value some century ago, i.e. in the days of the French Revolution ! 
Or, again, how different is the modern scientific use of the words ‘natui'e’ and 
‘ law ’ from the sense often to-day put upon them in popular or current language ! 
Indeed I am inclined to think that the irritating person, who insists in everyday 
life on definitions, is after all rather a social blessing than a social nuisance — for 
ip. my experience 90% of the wordy discussions which arise in ordinary life are 
due to the fact that the disputants have not first fixed the sense in which they 
are using some fundamental conception. 

In the present course of lectures, which will deal with the theory of probability , 
with chance, luck and the vexed question of the scientific measurement of belief, 
we shall have to be especially careful that we clearly define and appreciate ora 
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fundamental conceptions. This insistence on definition must be the starting-point 
of any really scientific discussion, and I want to urge you all to start the study 
of this or any other subject by trying to clearly define its scope and terms. You 
must in this respect ‘ list to what the friar preaches and not to what he does for 
according to a sharp-eyed reviewer I have myself been guilty of publishing a 
book, in which no definition of chance itself was given ! I will endeavour to 
supply that omission in to-day’s lecture. But first I want to point out the rela- 
tion between the subject of my present course and the topics of the two earlier 
ones on the Fundamental Concepts of Science and on Statistics. The relationship 
is a very close one indeed, far closer than might be imagined on a cursory 
examination. We shall find that statistics are the practical basis of much, if not 
aU scientific knowledge, while the theory of chance is not only based on past 
st&tistics but enables us to calculate the future from the past, the very essence of 
scientific knowledge. There is a close relation between provable and probable ; the 
analysis of 20,000 tosses of a coin will help us to penetrate into the very laboratory 
of Nature whose complexity presents us with results strikingly akin to those of 
a game of chance; while the record of a month’s roulette playing at Monte Carlo 
can afford us material for discussing the foundations of knowledge. That things 
apparently so diverse should be so closely related may strike some of you as 
paradoxical, and indeed the ground we are to venture upon abounds in diffi- 
culties and pitfalls. It is one where criticism and controversy have been very 
rife, but at the same time have been fruitful of results and have contributed’to 
clearness of thought. We shall find well-marked divergencies of opinion, charac- 
terizing two different schools, which push to extremes in different directions. 
Based on the- researches of Laplace and Queteiet, we find De Morgan, John 
Stuart Mill and Stanley Jevons pushing the possibilities of the theory of probability 
in too wide and unguarded a manner; while in the opposite camp we find George 
Boole and Dr Venn taking a severely critical and in some respects perhaps too 
narrow view of them. As in many other cases the safe roa^ is probably the middle 
road, and this road is that which .1 conceive Prof. Edgeworth of Oxford to have 
pointed out. For those of you who may have time for reading I would strongly 
recommend a comparison of Chaps, x-xn of Stanley Jevons’ Principles of 
Science with Chaps, vi-xi of Dr Venn’s Logic of Chance and Frof. Edgeworth’s 
Philosophy of Chance published in Mind for 1884. I shall refer to the opinions 
of these writers in the course of our work, but you would find the subject of 
chance as treated by them enticing in the extreme, and they will give you far 
more amply than I can do in these lectures the various features of the controversy. 

While dealing with the subject of books I may also refer to : 

Db MoEGtAw; Formal Logic (1857). Here Chaps, ix-xi are closely connected 
with the topics of our. first two lectures. 

De Moboan: An Essay on Probabilities (1838). This is stiU a useful and sug- 
gestive little book, although it requires some mathematical knowledge. 



KIakl Pearson 91 

WHiTWOETflc: Choice and Chance {3rd ed. 1878). An excellent book with which 
to approach the elements of the mathematical theory. 

Wbsteegaabd Hie Qrundzuge der Theorie der Statistik (1890). By far the best 
textbook on the relation of aiatiatioa and. probability for those who read 
German. 

Now I want to restate in the first place some of the conclusions I placed 
before you in hiy first course of Gresham Lectures. I do not want you now — any 
more than I did then — ^to accept those conclusions as your own but rather to 
probe and investigate them for yourselves, and thus ascertain whether they 
form a basis sufficiently sound for the superstructure placed upon them. The 
conclusions to which I want to draw your attention are those concerning the 
material of science, scientific law and cause and effect. In the first place the 
material of science consists of certain groups of sense-impressions, which we 
term phenomena and in which we mark not only a certain permanency but a 
routine. When we find a certain sequence of sense-impressions frequently re- 
peating ifself, we speak of any antecedent sense-impression as a cause, any 
subsequent one as an effect. A, B, G, D, E, F being a succession of sense- 
impressions, which repeats itself, A,B,G,D, E are aH termed causes of the effect 
F. A scientific law or formiila is a statement which enables us to resume or 
describe in brief language a routine sequence — or many such routine sequences-^ 
of causes and effects. As I pointed out to you, a scientific law does not enforce a 
sequence, it merely describes what. takes place. No law of gases causes or enforces 
the boiling of a kettle of water, when placed on thfe fire ; it merely describes how 
it boils. How then do we know that a kettle of water wiU boil, if placed on the 
fire? The answer is a very simple one, our knowledge of what will happen is 
based upon past experience. Statistics of past experience, our own or that of 
other men, are the basis of our knowledge of aU cause and effect, of aU know- 
ledge of phenomena. Here you have the kernel to statistics as the basis of 
knowledge. The statistics are formed in a rough practical manner, but are none 
the less real for all that. Take any sequence of phenomena such as a kettle of 
water boiling which has been. long enough on the fire. We have behind us the 
invariable experience that kettles in like positions do boil, and we say we know 
that this kettle will boil. If it does not we expect that some portion of the 
customary sequence fails, the fire has gone out, there is no water in the kettle, 
or there is somewhere a breach in the ‘group of causes’. But in our statement 
about the kettle there are really two important factors, there are the statistics 
of past experience, and the assumption that these statistics will apply to the 
future. There is no scientificreason why the same groups of causes should always 
be followed by the same effect. Indeed, a distinguished American mathe- 
matician, Mr 0. Pierce, has gone so far as to support the view that the causes, 
A, B, G, D, E, may be followed by F or (?, .indifferently. There is no logical or 
intellectual proof that like causes will be followed by like effects. It is purely a 
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result of experience. Statistics show ua the prevalency of routine in the past, and 
these statistics are the first basis of our knowledge. That what has held in the 
past, will hold for the future, is again a statement for which there is no proof; 
it is the outcome of our experience of what has happened in the pasta, which 
were at an earlier date futures. Hence our inferences with regard to natural 
phenomena are essentially based on statistics — ^namely statistics of what has 
happened in the past, and the experience that within certain ranges the statistics 
of the past repeat themselves in the future. Now I want you to grasp this.point 
very clearly, for we are coming close to the relationship between statistics, 
knowledge, belief and chance. What do we mean when we say that 106 boys are 
born as compared with 100 girls ^ Or when we assert that such will take place 
next year! Why simply this that the statistics of past years for a very great 
number of births give us boys and girls repeatedly in these proportions, and 
further experience — ^in other words statistics again—shows us that such 
statistical ratios do not change suddenly and abruptly, the results calculated for 
a period of four or five years, hold very closely for the following four or five 
years. Or, again, when we say that we hnow that the sun will rise to-morrow — 
we are just as much appealing to past experience of the action of the sun and 
past experience of the occurrence of routine, as when we appeal to the statistical 
appearance of births. The law of gravitation does not enforce the rising of the 
sun, it is merely a scientific description of what we observe in the motion of the 
planets. Suppose the sun had not risen on one well-authenticated occasion in 
our experience, and on one only, we should then be slightly less confident in our 
assertions as to its appearance to-morrow. Our knowledge would then have been 
weakened down into some very strong form of belief. The more frequently the 
sun had omitted to rise, the less strong would be our certainty with regard to its 
conduct to-morrow, until we passed through every shade of belief to disbelief 
itself. Or let us take a more tangible and possible case. A friend is leaving us, 
say in Chancery Lane at 4 o’clock in the afternoon, and we tell him that he will 
find a Hansom oah at the Fleet Street corner. There is no hesitation in our 
assertion. We speak with knowledge, because an invariable experience has shown 
us Hansom cabs at 4 o’clock in Fleet Street. But given the like conditions 
within reach of a suburban cab-stand, and our statement becomes less definite. 
We hesitate to say absolutely that there will be a cab : ‘ You are sure to find a cab ’ , 
‘I believe there will be a cab on the stand’, ‘There is likely to be a cab on the 
stand’, ‘There will possibly be a cab on the stand’, ‘There might perhaps be a 
cab’, ‘I don’t expect there’ll be a cab’, ‘Its very improbable’, ‘You are sure not 
to find a cab ’, etc., etc. In each and every case we go through some rough kind 
of statistics, once we remember to have seen the stand without a cab ; on occa- 
sions few and far between, ‘ perhaps on an average once a month ’, ‘ perhaps once 
a week’, ‘every other day’, ‘more often than not there has been no cab there’. 
Certainty in the case of Fleet Street passes through every phase of belief to dis- 
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belief in tlie case of the suburban cab-stand. If once a month is the very 
maximum of times I have seen an empty cab-stand, my belief that my friend 
will find a cab there to-day is far stronger than if I have seen it vacant once a 
week. A measure of my belief in the occurrence of some event in the future is 
thus based upon my statistical exi3erience of its occurrence or failure in the past. 
When in a wide range of experience there has been no experience of failure, then 
as in the case of the cab in Fleet Street, or in the ease of the sun rising to-morrow 
our belief becomes so strong that we speak of knowing. But aU this knowing 
really amounts to is a very high, or even the highest possible, degree oi probability. 
I know that tbe three angles of a triangle together make two right angles, for this 
lies in my definition of triangle, and belongs to the field of mental conceptions 
and not to physical phenomena. But of the physical universe I can only say I 
believe such and such things will occur, and the degree of my belief is measured 
in a rough approximate way by the statistics of past occurrence and failure. 

We can see this better, I think, by returning to the definite case of the cab 
on the stand. Once a week on the average of a long experience I have seen the 
stand empty. Thus for every six occasions there is a cab, there is one occasion 
that there is not a cab. Had I sought for a cab at the given hour for a loag 
period I should have been successful six times in every seven. We then define 
the ratio of the number of successful instances to the total number of occasions 
as the probability or chance of finding a cab — ^in this case the chance is 6/7. Thus 
the chance of an event is the numerical measure of past experience. It is based 
essentially on statistical information. How wide must be the range of information 
on which the chance is based we will consider later, for it involves very many 
important points. At present we have the following simple rule .' Taking the 
statistics of the occurrence, find the number of favourable instances and divide 
them by the total number of instances and this is the chance of the event. 
Returning to our cab-stand, suppose that only once in four weeks I have seen 
it empty at 4 o’clock on the average, then the chance of the event, finding a cab 
at 4 o’clock is 27/28 — i.e. in the long run 27 favourable instances per 28 
occurrences. 

Now, if I have found only one failure in 28 occurrences my hope of finding 
a cab on a particular occasion — ^my belief in there being a cab — ^will be far 
greater than if I have.found a failure once in 7 occasions. Thus my belief ia in 
some way related to the chance; if I know the chance is greater, my belief is 
greater. Prof. De Morgan has asserted that the proper measme of belief ia 
chance, and according to him my belief in the two oases cited above would be 
as 6/7 to 27 /28, or as 8 to 9. He thus reaches an exact numerical appreciation of 
belief, what might be termed a scientific measurement of belief. 

This view of De Morgan’s has been severely criticized by Dr Venn. He asserts 
that chance is something objective or physical — ^is based on statistics of the 
occurrence of a physical phenomena — ^while belief is something psychical and is 
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largely determined by the emotional and nervous temperament of the individual 
man. In other words it is subjective and not objective. According to Dr Venn 
probability deals with the laws of things, while according to De Morgan prob- 
ability has to do with the laws of our thought about things. 

Now I think we must agree with Dr Venn that it is impossible to set an 
absolute numerical value upon the beliefs of human beings in practical life. No 
one will venture to say that one of his beliefs is exactly nine times as strong as 
another. Perhaps the only practical measure we can form of the strength of 
beliefs is the readiness of men to act upon them, and the impetuous or credulous 
man will risk as much where the chance is small, as the sluggish or sagacious 
man where the chance is large. At an important crisis he will risk finding a cab 
on a chance which would have induced the prudent man to order one beforehand. 
Clearly then as applied to the beliefs of practical men in actual life, Dr Venn is 
right in asserting against De Morgan that we cannot put an exact numerical 
value on belief. On the other hand I think we must question whether chance 
can conveniently be treated as peculiar to things. The means by which statistics- 
are taken in practical life are human and they become subjective and individual 
in the process of taking and applying them. Besides this the statistics on which 
the chance may be reckoned are frequently at the option of the particular indi- 
vidual and the chance at once becomes subjective and peculiar to him. Let me 
point out what I mean. A man is tossing a coin in a railway carriage, a country 
lad in the carriage, judging by his experience of coins in the past, is ready to believe 
that the chance of a head is 1/2, i.e. that once in two occasions in the long run it 
will come down head. A scientific man (also without guile !) who has made 
experiments in tossing coins knows that every coin has a slight bias, and that 
there is in all probability a slight fraction of a per cent more heads or tails in the 
long run in the tossing of this particular coin. A man of the world knows that 
the coin-tosser is a swindler, and judges that his coin is loaded, so that the 
chance that it comes down head is very far from a half. And the coin-tosser 
himself? Well, he has no statistics at all of the conduct of this particular coin — 
it may be true or biased or loaded — but being an adept in tossing he can bring 
it down head or tail as he pleases. What are we to say is the chance that this 
coin will come down head? We have no statistics whatever of what happens 
when swindlers toss coins, which may after all unknown to them be loaded ! 
Are we to say that the chance is an un k nowable quantity, and that we cannot 
make any application of the theory of probability? I am inclined to tbinlc this 
would unduly narrow the field of our science. It seems to me that we can and 
should apply our theory to the chances subjectively estimated of each occupant 
of the railway carriage, These chances are based on the subjective experience of 
each individual with regard to coins under like conditions, and they certainly 
are more concerned with the laws under which people think about things, than 
with the laws of things themselves. If the country lad bets on a head the chance 
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of a head is for him one-half, for the scientific gentleman it must be a shade less 
or a shade more, for the man of the world it is a very small chance indeed, for 
the swindler it may be a certainty, if he means the lad to win on the first occasion, 
in order to excite him to betting heavier amounts. Now the beliefs of these four 
persons clearly differ in strength and their relative proportions are closely re- 
lated to the individual measurement of the chance, to what we may term the 
subjective chance. I am inclined to think with De Morgan that belief varies very 
closely with the subjective chance, but, this subjective chance depends upon the 
statistics of individual experience, and may differ widely from what we may 
term the objective chance, or the chance based upon statistics of the actual event 
in question and independent of the individual calculator. 

Turn (I hope, for the last time) to our cab-stand and the chance of finding a 
cab on it. Accurate statistics may have been taken of the absence of cabs upon 
it for a long period, perhaps, two or three years. For our present purposes these 
may represent the statistics for the calculation of the ‘ objective chance ’. I may 
have observed the cab-stand, not very regularly, for a few months, and my result 
is: empty about once a week at 4 o’clock; a friend knows nothing about this 
particular cab-stand, but has formed statistics of suburban cab-stands in general ; 
while another person without paying special attention to suburban cabs has 
formed pretty precise ideas as to cab-stands in London as a whole. The statistics 
of suburban cab-stands in particular, or of London cab-stands in general may 
be wide and accurate, or may be individual and approximate ; in either case it is 
a subjective act which classes the particular cab-stand under either of these 
headings, the particular chance selected is the result of individual experience or 
subjective choice. If we ask what is the relation between subjective chance and 
objective chance, I think we can safely say, that while the two often differ 
widely, yet the more deep a man’s experience, the more thorough his observation 
and his knowledge of phenomena, the more closely his subjective statistics will 
fit the objective statistics. He will never, perhaps, make the two coincide, but 
in the long run of practical life his mistakes will be few and tend to balance each 
other; his subjective chance will approximate to the objective chance in Dr 
Venn’s sense. He will know what classes of statistics to apply to individual cases 
with the best results, or in ordinary language, ‘he will be a judge of men and 
things ’. If experience of life and acquaintance with fact lead a man’s subjective 
appreciation of chance to approximate to the objective value of chance, may we 
not say that, if belief varies with a man’s subjective view of chance, then 
ultimately it is objective chance which governs belief? 

There is a difficulty here which is I think sometimes overlooked, but which 
seems to me fundamental and we must regard it with a little care. I take a coin 
and I say the chance, when it is tossed, of a head is one-half. Now what exactly 
does this mean? One or other of two things, either: 

(1) I have tossed this same coin 10 or 20,000 times and found practically the 
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same number of heads and tails. Here the subjective and objective chances are 
practically identical. Or : 

(2) I have not tossed this special coin at all, but judge it to be like other 
coins, of which my own rough experience, and that of other men, presents 
practical statistics of the equality in the number of heads and tails obtained in 
a great number of tosses. 

Here the subjective and objective chances may or may not be the same, for 
after all the coin may be loaded or even a double-headed one. But in this case 
also experience of the coin will ultimately bring the subjective and objective 
chances to the same value, be it 1/2 or otherwise. Now let us go a stage further 
and suppose that experience has brought the subjective appreciation of chance 
to its objective value. Would that objective value be a measure of my belief? 
Now there is an assumption here, which I have before referred to, and want you 
now to particularly notice. The chance is really based on past statistics, it is the 
number of successes observed by the total number of trials. 20,000 times the 
coin has been tossed and 10,000 times — ^within a few units, perhaps — heads 

have appeared ; the chance of head is ^ or 1/2. But this is not all we mean 

when we say the chance is a half. We refer to the future as well as to the past, 
and we assume that if we were to throw the coin an indefinite number of further 
times, there would be in the long run as many heads as tails. Here is the 
assumption we make when chance is taken as the basis of belief as to the future. 
In other words the statistics of past experience are assumed to be identical with 
the statistics of what will happen in the future. When I say that the chance of a 
head is one-half, that statement is meaningless, if it be considered as referring 
to a single toss, it refers to what I believe will happen on the average in a very 
great number of future throws — i.e. a practical equality of heads and tails. This 
belief is based on two elements, first, statistics of past experience as to tossing 
and secondly the permanence of statistical ratios. 

This latter is a most important element and one which in reality forms a 
large factor of belief. Let us bring this out more clearly by a comparison of one 
or two cases. Statistics of tosses show a coin to be a true coin, to give in the long 
run head as often as tail — chance of a head therefore 1/2. 

Statistics of a certain country show that of 206 births 106 are boys — chance 
of a boy being born 106/206. My statistical experience of a certain cab-stand, 
shows that on an average there is no cab there once a week at 4 o’clock — chance 
of a cab 6/7. 

Now before I apportion my belief of what will happen in the long run in the 
future in these several cases, I have to consider the permanence of these statistical 
numbers, and the only way I can do this is by examining statistics as to the 
permanence of similar numbers. The factor of my belief depending on the 
permanency of the chances is itself rooted in statistics. How often have I found 
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the change given by statistics to be constant, how often to change ? What indeed 
is the ‘chance’ of a chance changing? 

A coin has given as many heads as tails in the past, why should it not now 
begin to give a vastly greater proportion of heads? The appeal is again to 
experience, and experience tells us that, if a coin be not battered, bent or altered, 
it maintains indefinitely the same chance of a head. 

On the other hand experience tells us that while in vital statistics there is 
scarcely ever an abrupt change, ratios do alter slowly and gradually, the chance 
of a boy being born as calculated from the last few years may hold for the next 
few, but it may vary from decade to decade and century to century. 

Stfil more may the chance of finding a cab oh the stand vary. I may have 
carried out my observations for two or three months, but the completion of a 
new line of railway or a Licensing Act may make a sudden breach of continuity, 
there is much less permanence in statistics of this kind, than in those of coins 
or babies. Clearly the chance determined from past statistics is not the only 
factor in apportioning my convictions as to the future appearance of heads, boy 
babies, and cabs. The chance of the statistics remaining in the future what they 
have been in the past must also be considered and be shown to be the same in all 
the cases where beliefs are compared. 

Thus, I think, we must agree with Dr Venn, although partly on other grounds, 
in recognizing that the chance of an event is not an accurate numerical measme 
of our belief in its occurrence, but on the other hand we may go so far with De 
Morgan as to assert that our belief is strengthened or weakened when the sub- 
jective chance based on our personal knowledge or experience — or on the rough 
and ready statistics of practical life — ^is increased or decreased. 

We may even go a stage further and construct a model universe in the 
following manner ; Suppose a world in which men had such width of experience 
that their subjective appreciation of chance was equal to its objective value, and 
further that in this ideal world statistical ratios retained a permanent value — 
in the manner in which we actually find they do in games of chance — then in 
such a scientific ideal world chance might fairly be considered to measure belief. 

It may be asked what is the use of such an ideal model as this? In the flesh 
and blood men of actual life with their prejudices and half-knowledges the sub- 
jective appreciation of chance diverges often widely from its objective value; 
further in this real world few statistical ratios are actually permanent, they vary 
not only with time, but with the range and limits of our statistics. What then 
can the use of our model be ? Well, of much the same use as the political econo- 
mists’ model of society governed by the laws of exchange or value, or the 
physicists’ molecular model of nature. Neither is true to reahty, but both serve 
with certain reservations to describe in broad terms the general facts of economic 
and physical phenomena. In the same manner, because in a rough and approxi- 
mate way men’s subjective appreciation of chance does tend in the practical 
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experience of life to approach the objective value — and because in a great 
variety of oases chances calculated on past experience are found to remain 
permanent in the immediate future — so men’s beliefs as evidenced especially 
in conduct do vary with chance; and if chance be not a scientific measure of 
belief, it is yet in the average of men a rough and ready means of gauging, more 
or less accurately, the relative strength of convictions. 

It may seem strange to some of you to be told that chance is the measure of 
past experience. At first there may appear to be a very considerable difference 
between the chance that a boy or a girl will be born and the chance that a head 
or tail will turn up. We are quite ready to admit that statistics are needful in 
order to determine whether more boys or girls are born and so to determine the 
chance of a boy or girl birth. But we are not inclined at first to admit an equal 
necessity in the case of the coin. We are inclined to argue that ‘We see no reason 
why head should occur more frequently than tail ’ and then convert this into 
‘There can be no reason why head should occur more frequently than tail’ — 
and then ‘Head and tail must be equally frequent’. You will see the weakness 
of this argument at once by applying it to the case of hoys and girls. ‘We see no 
reason why more boys should be born than girls.’ ‘There can therefore be no 
reason why more boys should he born than girls, ’ and finally ‘ No more boys are 
born than girls’. Here statistics step in and upset all our preconceived notions. 
In fact aU arguments of this kind remind us of the old mediaeval notions of 
physical science, which began by arguing as to what nature ought to do, instead 
of patiently observing what she did do. We may sec no reason why head should 
occur rather than tail. But if there were not a very definite reason why head 
or tail should have the preference in each individual throw, then we may be 
quite sure that the coin would balance on its edge and exhibit neither head 
nor tail. 

Mere inspection of a coin would certainly not suffice to tell ua that the 
‘ chances ’ of head and tail are equal. The head is different in shape and appear- 
ance from the tail, and the coin may really be biased by- this. Let us get over 
this difficulty by taking a perfectly uniform disk absolutely alike on both sides, 
and let us imagine it thrown up so that neither side has any advantage either on 
leaving our hand or on reaching the ground. Can we say that the chances of 
either side are equal without any appeal to experience — to statistics ? The fact 
is that if the two events were absolutely balanced in this manner, if not only we 
saw no reason why one should occur more than the other, but there was no 
reason, then there would be no chance of either event occurring at all. In our 
experience of nature there is no such thing as chance of this kind. The moment 
a coin or a die leaves the hand, its fate is really settled and there is no field for 
the ‘play of chance’ in the obscure sense we have just been referring to. The 
mechanical causes are perfectly definite and the occurrence of head or tail, 
ace or deuce, absolutely certain. It is quite true that these mechanical causes 
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are far too complex, too evenly balanced and too incapable of measurement 
for ns to mechanically describe what must happen, and so predict head or tail. 
Mechanically the one or other is predetermined, but the multiplicity of causes 
varying so slightly and yet so effectively from throw to throw leaves tts in ignor- 
ance as to the result. If we are merely ignorant as to a result which is mechanically 
perfectly certain, what is the meaning of chance in physical nature ? Simply this 
that we aid our ignorance by an appeal to past statistical experience. The chances 
of a coin falling head or tail being equal does not depend on my ignorance of what 
will occur, or on my seemg'no reason why head more thaii tail should occur, but 
on my experience of the statistics of tossing coins. This experience is really summed 
up in the symbolic slang ‘ a toss up ’ — as an expression for an equality of chances. 
I know from my own personal experience and from the common habits of men — 
as well as from the statements of gamblers and others — ^that loaded coins are 
not of frequent occurrence. Without this experience I could predict nothing of 
the tossing of a coin, it might invariably come down 99 % head ; or having fallen 
on the first occasion head or tail, that fall might in itself determine what it 
would do on the second occasion. 

What I have said of tossing a coin holds good for the drawing of black and 
white balls out of a bag. It might seem at first sight that if 60 white and 60 
black balls were put into a bag and well mixed, then, each ball being replaced 
after the drawing, as many white as black balls will be drawn in a large number 
of trials. In other words the chances of drawing white and black balls are equal. 
But here again, if our statement is really to mean anything we must be appealing 
to some rough experience of the conduct of balls in bags. It is conceivable that 
the hand might have a preference for black balls, or that white balls would have 
a preference for each other and the bottom of the bag. If it be objected that the 
hand does not detect colour difierence, and that gravity acts equally on equal 
balls if they be of different colours, we are at once appealing to a wide statistical 
experience resumed in certain fundamental laws of nature. We have left at once 
the shaky ground of subjective reasoning, and turned to statistics. 

But even in these cases direct experiment comes to om? aid and provides the 
statistics, which are only roughly embodied in the everyday experience and 
opinions of mankind. Thus : 

Bufjton tossed a coin 4040 times, there resulted; 1992 heads, 2048 tails, or 
49 % heads, 61 % tads. 

Qubtelet made 4096 drawings out of a bag containing an equal number of 
black and white balls, there resulted; white balls 2066, black balls 2030, or 
60-4% white and 49-6% black. 

Me, Geifeith, one of my students, has kindly tossed a penny 8178 times and 
there resulted; 4092 heads, 4086 tails, or 50-04% and 49-96% tads. 

Wbstbk&aaei) made 10,000 drawings out of a bag containing equal numbers 
of red and white balls well shaken before each drawing after the replacement of 
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the previously drawn ball. He obtained: white balls 5011, red balls 4989, or 
60’11% white and 49*89% red. 

Yotje PEESiNT Lecturer tossed (as a holiday task) a shilling 24,000 times. 
He obtained for the first 12,000 tosses : 6981 heads, 6019 tails, or 49*84 % heads, 
60*16% tails; second 12,000 tosses: 6992 heads, 6008 tails, or 49*933,% heads, 
60*067 % tails. In both series there is a balance in favour of tails: in the first 
of less than 1/6 % ; in the second of about 1/15 %. 

Taking both series together we have: 24,000 tosses: 11,973 heads, 12,027 
tails, or 49*8875% heads and 60*1125% tads. 

To avoid any chance of there being a slight loading in the coin— a very slight 
bias towards tails— let us call the heads, tails and tails, heads in the first 
12,000 tosses, we then find: 12,011 heads and 11,989 tails; or 60*046% heads and 
49*964% tails. 

Thus to 1/20 of a per cent heads and tails are equal, or to express it in another 
manner there has on the average been only one head too many in 1200 tosses, 
Bufion’s experiments coincide with min© in showing a slight bias in favour of 
tail. 

Tinally I have analysed the red and black events in an entire month’s play of 
the roulette tables at Monte Carlo. I find that out of 16,178 throws of the ball* 
8111 fell into a red number and 8067 into a black, or there were 60*14% red 
and 49*86% black. 

These experiments amply confirm the rougher statistical experience of man- 
kind as to the equality of chances in tossing, or drawing balls from bags, or 
playing roulette. It is on experience of this kind, on accurate statistical measure- 
ment, not on a priori reasoning or subjective opinion, that the data of probability 
are to be based. 

In my next lecture I shall deal more at length with the nature of the statistics 
by which we supplement our ignorance of what is about to happen. 


* The twenty-seventh figure, 0, was of course omitted to equalize chances. 
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MEDICAL STATISTICS FROM GRAUNT TO FARR 
By major greenwood 
INTRODTJCTION 

Unbbe the Fitzpatrick Trust, a Fellow of the Royal College of Physicians of 
London is chosen annually by the President and Censors to deliver two lectures 
in the College on ‘ The History of Medicine I had the honour of being chosen 
for this office in 1940 but, for obvious reasons, the lectures were not delivered, 
and it may be safely assumed that some years will pass before a medical audience 
will have time to attend to the history of a subject the modern practice of which 
does not make a strong appeal to physicians. 

The nature of the intended audience inclined me to stress the medical rather 
than the purely statistical aspects of the story and I have trodden ground over 
which a greater man passed some years ago. I hope that Karl Pearson’s studies 
of some or aU of these old heroes will eventually be printed, and I know that my 
slight essays can ill sustain a comparison. But, precisely because they are 
slight and linger over small traits and human oddities, they may, in these times, 
wile away an hour or two. I have eliminated some explanations which no 
statistician or biometrician needs and the medical technicalities are few. Perhaps 
a note on the London College of Physicians as it was in the days to which these 
studies relate should be added. 

The College was more than a century old when John Graunt was born, and 
the corporation consisted wholly of physicians who were Doctors of Medicine of 
Oxford or Cambridge; these were the Fellows. Physicians not Doctors of 
Medicine of Oxford or Cambridge were admissible only to the grade of Licentiate, 
and it was not until the nineteenth century, when Farr was a young man, that 
the exclusive privilege of the senior universities was abolished. It was not until 
Farr was a middle-aged man that the College had any direct contact with general 
practitioners of medicine and began to examine persons who did not seek to 
practise solely as physicians. In modern usage the College licence, L.R.C.P. 
(now only granted jointly with the membership of the Royal College of Surgeons, 
M.R.C.S.), is a diploma obtained by a large proportion of general medical 
practitioners in the South of England. Down to Farr’s time, the L.R.C.P. was a 
‘specialist’ diploma and could not have been taken by a general practitioner 
(the apothecary of those days) at all. The old L.R.C.P. is represented by the 
M.R.C.P. of our own time but with this distinction. Now, Fellows (F.R.C.P.) 
are normally chosen from the body of M.R.C.P.’s. In the past only Doctors of 
Medicine of Oxford or Cambridge could be Fellows, and before election but after 
examination were known as ‘candidates’, not licentiates. The great physician 
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Sydenham was. never more than a licentiate. He graduated M.B. at Oxford and, 
for some unknown reason, never proceeded M.D. until near the end of his life, 
when he took the higher degree not at Oxford but at Cambridge. 

I, THE LIVES OF PETTY AND GRAUNT 

It is always rash to assign an absolute beginning to any form of intellectual 
effort, to say that this or that man was the very first to fashion some organon 
which has proved valuable. All we are justified in saying is that this or that 
ipan’s work can be shown to have so directly influenced the thought of his con- 
temporaries or successors that from his day the method he used has never been 
forgotten. It muy be that fhe lost works of the school of the Empirics Galen 
despised anticipated the numerical method of Louis — some words of Celsus are 
consistent with the hypothesis. It may be that in the long succession of parish 
clerks who for more than a century transcribed the London Bills of Mortality, 
one or two suggested that these figures might have some other use than that of 
warning His Highness of the need to move into Clean Air. But we do not know. 
We do know that out of the casual intercourse of two Englishmen in the seven- 
teenth century was produced a method of scientific investigation which has 
never ceased to be applied and has influenced for good or ill the thought of all 
mankind. In that sense at legist we may fairly hold that John Graunt and 
William Petty were the pioneers not only of medical. statistics and vital statistics 
but of the numerical method as applied to the phenomena of human society. 

John Graunt and WilLiam Petty were both of Hampshire stock. Petty was 
of Hampshire birth, born on Monday, 26 May 1623, and was three years younger 
than John Graunt, who was born at the Seven Stars in Birchin Lane on 24 April 
1620. 

Materials for writing Petty’s fife are abundant ; indeed a good biography of 
him was written nearly fifty years ago by his descendant Lord Edmond Eitz- 
maurice, and since then much of the material used by Lord Edmond has been 
printed. Sources for Graunt’s biography are scanty, the most valuable John 
Aubrey’s brief life of him.’" Graunt and Petty became acquainted in or before 
1660. The circumstances of that first acquaintance are interesting to those who 
meditate upon the perepeteia of human fate. It was the contact of client and 
patron. 

John Graunt’s early life and manhood were those of the Industrious 
Apprentice. His father was a city tradesman, who bred his son to the profession 
of haberdasher of small wares. John ‘rose early in the morning to his study 
before shop-time ’.and learned Latin and French, hut did not neglect his business. 
He was free of the Drapers’ Company and went through the city offices as far as 

* Bri&f Lives, chiefly of Oonteunporaries, set down by John Aubrey, between the years 1669 and 
1696, edited by Andrew Clark, Oxford, 1898, 1, 271 et seq. 
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common councilman ; he was captain and then major of the trained bands (the 
ancestor of the Honourable Artillery Company). At the time of the Great Fire 
he is said to have been an opulent merchant. Even fifteen years earlier he — and 
no doubt his father (1692-1662) — ^had city influence. At that time a Gresham 
professorship was vacant and a young Dr Petty was anxious to obtain it. This 
young man’s career had been unlike that of an industrious apprentice ; it had 
been, even for the seventeenth century, romantic. His father was a clothier in 
Romsey, who ‘did dye his owne cloathes’ in a small way of business. When 
William was a child, ‘his greatest delight was to be looking on the artificers — 
e.g. smyths, the watch-maker, carpenters, joyners etc. — and at twelve years old 
could have workefd at any of these trades. Here he went to schools, and learnt 
by 12 yeares a competent smattering of Latin, and was entred into Greek’ 
(Aubrey, Clark’s edition, 2, 140). 

But the precocious lad did not find a patron in Romsey and was shipped for 
a cabin boy at the age of fourteen. His short sight earned him a taste of the 
rope’s end,' and after rather less than a year at sea he broke his leg and was set 
ashore in Caen to shift for himself. ‘Le petit matelot atoglois qui parle latm et 
greo’ attracted sympathy and obtained instruction in Caen. Caen was not a 
famous seat of learning like Leyden or Montpellier, but the Fellows and 
licentiates of the College of Physicians admitted between 1640 and 1700 include 
the names of four persons who studied or graduated in Caen (Nicholas Lamy, 
Theophilus Garenci^res, John Peaohi and Richard Griffiths). Petty, however, 
was not then thinking of medicine but mathematics and navigation and came 
home to join the navy. In what capacity he served is unknown; he merely says 
(in his Will) that his knowledge of arithmetic, geometry, astronomy conducing 
to navigation, etc., and his having been at the University of Caen, ‘preferred me 
to the King’s Navy where at the age of 20 years, I had gotten up about three 
score pounds, with as much mathematics as any of my age was known to have 
had’. His naval career was short, for in 1643 he was again on the continent. 
Here he wandered in the Netherlands and France and studied medicine or at 
least anatomy. He frequented the company of more eminent refugees, such as 
Pell and Hobbes, as well as that of the French mathematician Mersen. He was 
very poor and told Aubrey that he once lived for a week on three pennyworth of 
walnuts, but on his return to England the three score pounds had increased to 
seventy and he had also educated his brother Anthony. 

At fii’st Petty seems to have tried to make a living out of his father’s business, 
but he soon went to London with a patented manifold letter writer and sundry 
other schemes of an educational character. These occupied him between 1643 
and 1649 and made him acquainted with various men of science, aniong others 
Wallis and Wilkins, but were not remunerative, and in 164‘9 he migrated to 
Oxford. 

Petty was created Doctor of Medicine on 7 March 1649 by virtue of a 
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dispensation from the delegates (no doubt the parliamentary equivalent of the 
Royal Mandate of later and earlier times). He was also made a Fellow of 
Brasenose and had already been appointed deputy to the Professor of Anatomy. 
He was admitted a candidate of the College of Physicians in June 1660 (he was 
not elected a Fellow until 1665 and was admitted on 26 June 1668). At Oxford 
he became something of a popular hero by resuscitating (on 14 December 1661) 
an inefficiently hanged criminal, who, condemned for the murder of an iUegifi- 
mate child, is said to have survived to be the mother of lawfully begotten 
offspring. 

Academically Petty rose to be full Professor of Anatomy and Vice-Principal 
of Brasenose. It is at this point (as usual the precise dates are dubious) that he 
became a candidate for , a Gresham professorship and made contact with John 
Graunt. 

Although, as I have said, the materials for a biography of Petty are abundant, 
all we know of his early years comes from himself or from friends of later life 
who knew no more than he told them. We have no independent means of 
judging the extent of his culture. There is good evidence that he knew more 
Latin than most Fellows of the College of Physicians know now ; none that he 
was an exact scholar (indeed we have his own word, which I am not prepared 
to gainsay,* to the contrary). He was certainly admitted to friendship by some 
men, such as Wallis and PeU, who were serious mathematicians, as by others, 
such as Hobbes, who were not. But whether he could fairly be called a mathe- 
matician is doubtful. Of his medical knowledge we know little.. He left medical 
manuscripts, but these are stiQ unpublished ; of his clinical experience we know 
nothing. 

Petty told Aubrey that ‘he hath read but little, that is to say, not since 
26 aetat., and is of Mr. Hobbes his mind, that had he read much, as some men 
have, he had not known as much as he does, nor should have made such dis- 
coveries and improvements ’. But it is at least certain that he made a favourable 
impression upon men who had read a good deal and that the young Dr Petty of 
1680 was thought a promising man. Still it had been an odd career and one 
wonders what a steady business man in the city of London thought of it. 

Why the anatomy professor who had resuscitated half-hanged Ann Green 
should be made a professor of music is not obvious, and if the Gresham appoint- 
ments were jobs, why should the job be done for Petty ? The modern imaginative 
historian might suggest various reasons. For instance, that Petty made a 

* If Ko. 88 of The, Petty Papers (2, 36) is a typical example of Petty’s Latin Prose style, there 
is not much to be said for it. Here is an example:. ‘An duloius est humanae naturae permultos 
suam potestatem in unum quendam et in perpetuum transferre, id est pendis amittere quam ipso 
puel deindem servare, vel paidatium et in breve tempus irogare, a seipsis demo reformendam et 
disponendam alioquin pro ut, luutato tam rerum quam animi indies suaserit?’ Some of the 
gibberish may be due to the editor’s fadure to decipher the handwriting, but no emendation could 
twist this into unbarbario prose. 
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conquest of Graunt, perhaps had Hampshire friends who were friends of the 
Graunt family, perhaps talked about political arithmetic. We have no evidence 
at all. If the Gresham Professor of Music had duties, Petty did not perform them ; 
about the time of his appointment he obtained leave of absence from Brasenose 
and within a year (in 1662) had left for Ireland, where he was to be very busy 
for some time to come and to make, or found, his material fortunes. 

Macaulay (chap, m) says that at the end of the Stuart period the greatest 
estates in the kingdom very little exceeded twenty thousand a year. 

The Duke of Ormond had twenty- two thousand a year. The Duke of Buokingham, 
before his extravagance had impaired his great property, had nineteen thousand six 
hundred a year. George Monk, Dulte of Albemarle, who had been rewarded for his eminent 
services with immense grants of crown land, and who had been notorious both for covetous- 
ness and for parsimony, left fifteen thousand a year of real estate, and sixty thousand 
pounds in money, which probably yielded seven per cent. These three Dukes were supposed 
to be three of the very richest subjects in England. 

In 1686 Petty made his Will. This Will is a curiously interesting document, 
because it is also an autobiography. It is rich in arithmetical statements and, 
like much of Petty’s arithmetic, the statements may be optimistic. Petty’s final 
casting of his accounts is in this fashion: ‘Whereupon I say in gross, that my 
reall estate or income may be £6,600 per ann. my personal! estate about £46,000, 
my bad and desparate debts, 30 thousand pounds, and the improvements may 
be £4000 per arm., in all £18,000 per ann. ui supra.’ 

The details of the calculation are perplexing enough ; still if the above cited 
dukes were the richest subjects of the king and if (Macaulay) ‘the average income 
of a temporal peer was estimated by the best informed persons, at about three 
thousand a year ’, Sir William Petty, of the year 1686, had travelled as far from 
the young Oxford professor of 1650 as that budding physician from the little 
English cabin boy who spoke Latin and Greek, in Caen, in 1638. The details of 
the fortune-building are not our concern. The shortest account is Petty’s own in 
his Will. He says that by the end of his Oxford career he had a stock of four 
hundred pounds and received an advance of one hundred more on setting out 
for Ireland. 

Upon the tenth of September, 1662, 1 landed att Waterford, in Ireland, Phisitian to the 
army, who had suppressed the Rebellion began in the year 1641, and to the Generali of the 
same, and the Head Quarters, at the rate of 20s. per diein, at which I continued, till June, 
1669, gaming by my practice about £400 per annum, above the said sallary. About 
September, 1664, 1, perceiving that the admeasurement of the lands forfeited by the fore- 
mentioned Rebellion, and intended to regulate the satisfaction of the soldiers who had 
suppressed the same, was moat insufficiently and absurdly managed, I obtained a contract, 
dated the 11th. of December, 1664, for making the said admeasurement, and by God’s 
blessing so performed the same as that I gained about nine thousand pounds thereby, which 
with the £600 above mentioned, my saUary of 20s. per diem, the benefit of my practice, 
together with £600 given me for directing an after survey of the adventrs lands, and £800 
more for 2 years sallary as Clerk of the Coimcell, raised me an estate of about thirteen thou- 
sand pounds in ready and reall money, at a time, when, without art, interest, or authority, 

Biometrika xxxii 8 
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mon bought as much lands for 10s, in reall money as in this year, 1686, yield 10s. per ann. 
rent above his Mattes quitt rents {The Life of Sir William Petty, by Lord Edmond Fitz- 
manrice, London 1895, p. 319). 

No one would willingly rake over the embers of Irish history — still glowing 
after nearly three hundred years. Petty believed himself to be a good man 
struggling against adversity and a public benefactor treated with gross injustice 
to the day of his death. Lecky {History of Ireland, vol. 1, chap. 1, p. Ill of 
popular edition) took a less favourable view. Even if the subject were relevant . 
to my undertaking, which it is not, I have not the training in historical research 
to justify me in writing about it. There are, however, some points of psycho- 
logical interest. 

Petty did not, like his contemporary Thomas Sydenham, actually take up 
arms against the king, but he was even more plainly a protegd of the king’s 
enemies. Sydenham’s military career was unimportant; there is no reason to 
believe that he ever exchanged a word with a member of the Cromwell family. 
Petty was the confidential adviser and close personal friend of Henry Cromwell ; 
his services to the Commonwealth authorities were the foundation of his fortune. 
Lilre many people who have social gifts he had the gentle art of making enemies. 

Pepys, Aubrey and Evelyn concur in the judgment that Petty was a most 
entertaining companion. Evelyn says he was a wonderful mimic. He could 
speak ‘now like a grave orthodox divine ; then falling into the Presbyterian way ; 
then to Eanatical, to Quaker, to Monk, and to Eriar and to Popish Priest’. The 
gift he exercised among his friends. 

My Lord D. of Ormond once obtained it of him, and was almost ravished with admira- 
tion; blit by and by he fell upon a serious reprimand of the faults and miscarriages of some 
Princes and Governors, which, though he named none, did so sensibly touch the Duke, 
who was then Lieutenant of Ireland, that ho began to be very uneasy, and wished the spirit 
layed, which he had raised ; for he was neither able to endure such truths, nor could but be 
delighted. At last he turned his discourse to a ridiculous subject, and come down from the 
joint-stool on which he had stood, but my lord would not have him preach any more 
(Evelyn). 

My lord Duke was not the first or last person to fail to relish a joke against 
himself. 

In The Londoners a challenged party names garden hoes as the weapons. 
That was Mr Robert Hichens’s fun. In real life, Petty, challenged to mortal 
combat by a Cromwellian soldier, pleaded his myopia and demanded that the 
duel should take place in a cellar and the weapons be axes. 

A man like this makes friends or at least admirers, also enemies. Long before 
the king enjoyed his own again, Petty had a host of enemies. When the king 
returned, one might have expected that Petty’s position would be critical. 
According to his own account fic-did lose something, but he was knighted and the 
losses, such as they were, did not seem to stay the growth of his fortune. At the 
Restoration he was already prosperous and he died wealthy; Perhaps the 
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explanation is that Petty was really as great a public benefactor as he thought 
he was. Perhaps the reason is personal. King Charles loved wits (in the old and 
new sense of the word) and Petty was a wit. The scanty specimens of what 
Petty’s modern representative calls ‘Rabelaisian’ printed from the Petty papers 
would not have appealed to such a connoisseur in this genre as the king — ^we 
know from Halifax that the king liked to be the raconteur in this field and indeed 
repeated himself often — but he would have relished a good mimic. Still more 
important might have been their common virtuosity. 

Charles was interested in experimental science, and although Petty certainly 
knew more than the king, he may not have known very much more. Neither 
Charles nor James would have been able to find more common ground with 
Isaac Newton than in a later age Bonaparte found with Laplace. But the 
ingenious Dr Petty, who had resuscitated half -hanged Ann Green (which would 
be a capital story if well told), invented an unsinkable ship, had a dozen plans 
for doubling the king’s revenue, and knew something of everything, probably 
did more than Wilkins to interest the king in the new society of virtuosos (how 
the Icing must have relished the story of the planting of horns in Goa*), and he 
may incidentally have interested the king in his business affairs. This is all 
speculation; what is sure is that when Petty was back in London and able to 
renew personal intercourse with John Graunt, their relation was no longer that 
of client and patron. For a few years more, Graunt was to be a solid merchant, 
but before long Petty was the patron and Graunt the client. 

At this point it will be convenient to conclude the biographical facts relating 
to Graunt. I take them mainly from Aubrey. 

Graunt continued to be a prosperous city tradesman for many years after 
his first meeting with Petty. ‘He was’, says Aubrey, ‘a man generally beloved; 
a faithful friend. Often chosen for his prudence and justice to be an arbitrator; 
and he was a great peace-maker. He had an excellent working head, and was 
facetious and. fluent in his conversation.’ Pepys thought as well of Graunt as did 
Aubrey, admiring both his conversation and his collection of prints — ‘the best 
collection of anything almost that ever I saw’. 

Prom the Restoration for several years Graunt figures in London intellectual 
society (he was elected P.R.S. in 1663), but a material calamity was at hand. 
The Fire of 1666 no doubt caused Graunt direct financial loss; this might have 
been repaired. But, although brought up in Puritan ways, ‘he fell’, to quote 
Aubrey, ‘to buying and reading of the best Socinian bookes, and for several! 

* Sir Philiberto Vecnatti, Resident in Batavia, had certain inquiries sent him by order of the 
Royal Society. The eighth question was: ‘What ground there may be for that Relation, concerning 
Homs taking root, and growing about Goa?’ This is Sir Philiberto’s answer: ‘Inquiring about this, 
a friend laughed, and told me it was a jeer put upon the Portuguese, because the women of Goa are 
counted much given to lechery’ (Sprat’s History of the Boyal Society of Lcmdm, 2nd ed. London 
1702, p. 161). 
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3 ars continued of that opinion. At least, about. . .he turned a Roman Catho- 
:jue, of which religion he dyed a great zealot.’ 

Graunt’s path to Rome was similar to that of young Edmund Gibbon, but 
le results on the career of a city tradesman in the days of Oates triumpJians 
ere more serious than a visit to Lausanne. Graunt became bankrupt. His 
ame dropped out of the list of the Royal Society after 1666, and in 1674 he 
Led. There is evidence that in these last years of worldly misfortune, when the 
heel had come full circle since Graunt had secured the Gresham professorship 
)r Petty, Petty helped Graunt. When Petty was in Ireland, Graunt acted in 
)me sort as his London agent, and Petty conceived a plan of settliag Graunt in 
'eland. But (we have, of course, only Petty’s word for this) Graunt was not an 
isy man to help ; it is possible, of course, that he may have resented Petty’s 
dmonitions. ‘You have done amiss in sundry particulars, which I need not 
lention because you yourself may easily conjecture my meanings. However we 
lave these things to God and be mindful of what is the sum of all religion, and 
f what is and ever was true religion all the world over.’ This is an extract from 
letter of January 1673 to Graunt {The Petty-Southwell Correspondence) p. xxix) 
rmted by the late Marquis of Lansdowne. If Lord Lansdowne was right (the 
rhole letter is not printed) in thinking this a reference to Graunt’s conversion 
ar perversion) ‘of which’, says Lord Lansdowne, ‘Petty seems to have dis- 
pproved on temporal rather than spiritual grounds’, it might have hurt a 
ensitive man. 

Graunt died on Easter Eve 1674 and was buried the Wednesday following in 
it Dunstan’s church in Pleet Street. ‘A great number of ingeniose persons 
ttended him to his grave. Among others, with teares, was that ingeniose great 
drtuoso. Sir William Petty, his old and intimate acquaintance, who \vas sometime 
I. student of Brasenose College.’ Sir William outlived his friend thirteen years 
md lies in Romsey Abbey. Until a descendant in the nineteenth century (the 
ihird Marquis of Lansdowne) erected a monument, ‘not even an inscription 
ndicated that the founder of pohtioal economy lay in Rumsey Abbey’- (Fitz- 
naurice, p. 315). 

Graunt had a son who died in Persia and a daughter who, according to 
Aubrey, became a nun at Ghent. Nothing is known of descendants. 

Petty’s -widow was raised to the peerage and her elder sons, Charles and 
Eenry, died without issue. But the title was revived in favour of the grandson 
of John Pitzmaurice, the second survi-ving son of Thomas Eitzmaurice, Earl of 
Kerry, who, as the above-mentioned grandson remarked, had ‘married luckily 
For me and mine, a very ugly woman who brought into his family whatever degree 
of sense may have appeared in it, or whatever wealth is likely to remain in it’. 
This ill-favoured woman was Petty’s daughter Anne, to whom her father wrote : 

My pretty little Pusling and my daughter Ann 
That shall bee a counteaae, if her pappa can. 
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The cynical grandson was George Ill’s prime minister and afterwards his bite 
noire, ‘The Jesuit of Berkley Square’ and first Marquis of Lansdowne. 

Of the two friends, one has left an intellectual monument only ; descendants 
of the other have been famous in English history. 

Of these, best known are the first and third Marquises of Lansdowne, 
William (1737-1805) and Henry (1780-1863). Of the first marquis, much better 
known as Lord Shelburne (the title created for Lady Petty), every schoolboy — 
not only Macaxilay’s schoolboy — has heard; the quarrel between Charles Eox 
and Shelburne, the party split, the coalition ministry and so on. Schoolboys who 
have reached the sixth and Lecky’s History of England in the Eighteenth Century, 
know a little more. Shelburne, who had much more than a tincture of his 
great-grandfather’s ability and applied himself to economic studies, was one of 
the earliest to appreciate the importance of Adam Smith and was highly thought 
of by two good . judges of scientific ability, Benjamin Eranklin and Jeremy 
Bentham-. 

As a public man, no parliamentary statesman before or since obtained so 
universal a dislike, a positive hatred shared by those who knew him and those 
who did not. 

There is certainly nothing in the actions of Shelburne to justify this extreme un- 
popularity. Much of it was, I believe, simply due to an artificial, overstrained, and affectedly 
obsequious manner, but much also to certain faults of character, which it is not difficult to 
detect. Most of the portraits that were drawn of him concur in representing him as a harsh, 
csmical, and sarcastic judge of the motives of others; extremely suspicious; jealous and 
reserved in his dealings with his colleagues ; accustomed to pursue tenaciously ends of his 
own, which he did not frankly communicate, and frequently passing from a language of 
great superciliousness and arrogance to a strain of profuse flattery (Lecky, 5, 130). 

How far some of these characteristics may be recognized in Shelburne’s 
ancestor, we shall inquire in due course. 

The contrast between Malagrida* and his son Henry is shattering. It is this 
Marquis of Lansdowne of whom nearly everybody thinks when he sees the title 
in a book, and rightly so. Walter Bagehot wrote: 

You may observe that when an ancient liberal, Lord John Russell, or any of the 
essential sect, has done anything very queer, the last thing you would imagine anybody 
would dream of doing, and is attacked for it, he always answers boldly, ‘Lord Lansdowne 
said I might' •, or if it is a ponderous day, the eloquence runs, ‘A noble friend with whom I 
have had the inestimable advantage of being associated from the commencement (the 
infantile period I might say) of my political life, and to whose advice,’ etc,, etc., etc. — and a 
very cheerful existence it must be for ‘my noble friend’ to be expected to justify — (for they 
never say it except they have done something very odd) — and dignify every aberration. 
Still it must be-a beautiful feeling to have a man like Lord John, to have a stiff, small man 

* Malagrida was an Italian Jesuit settled in Portugal who was burned in 1761. The supposed 
Jesuitical propensities of Shelburne led to the name becoming his popular title. Hence Goldsmith’s 
unintended mot : ‘ Do you know that I never could conceive the reason why they call you Malagrida, 
for Malagrida was a very good sort of man.’ 
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bowing down before you. And a good judge (Sydney Smith) certainly suggestod the oon- 
ferrhig of this authority. ‘ Why do they not talk over the virtue-s and oxcellencios of 
Lansdowne? There is no man. who performs the duties of life better, or fills a high station 
in a more becoming manner. He is full of knowledge, and eager for its acquisition. His 
remarkable politeness is the result of good nature, regulated by good sense. He looks for 
talents and qualities among all ranks of men, and adds them to his stock of society, as a 
botanist does his plants; and while other aristocrats are yawning among stars and garters, 
Lansdowne is refreshing his soul with the fancy and genius which he has found in odd places, 
and gathered to the marbles and pictures of his palace. Then he is an honest politician, a 
wise statesman, and has a philosophic mind’, etc., etc. Here is devotion for a carping critic ; 
and who ever heard before of bonhomie in an idol? (Bagehot, Works, 2, 64-S). 

Of the father, Atticus (an alias of ‘Junius’) wrote: 

The Earl of Shelburne had initiated himself in business by carrying messages between 
the Earl of Bute and Mr. Eox, and was for some time a favourite with both. Before he was 
an ensign he thought himself fit to be a general, and to be a leading minister before he ever 
saw a public office. The life of this young man is a satire on mankind. The treachery which 
deserts a friend, might be a virtue compared to the fawning baseness which attaches itself 
to a declared enemy (Letters of Junius, Wade’s edition, 2, 248). 

Naturally justice was no more to be expected in eighteenth-century news- 
paper diatribes than in the twentieth century, but a clever caricaturist does not 
represent Charles Pox as a living skeleton. Those who attacked the son — there 
were such people — took a different line, as Bagehot hints. Perhaps even in his 
very different character something of the ancestral Petty survives. We shall try 
to discover what this was. 

Porty years ago Hull brought out an edition of Petty’s tracts in which he 
included Graunt’s work. In 1927 the fifth Marquis of Lansdowne printed a 
selection from the Petty papers and in 1928 the correspondence between Petty 
and his wife’s cousin,* Sir Robert Southwell {The Petty -Southwell Correspondence, 
edited by the Marquis of Lansdowne, London 1928). 

We shall have to examine in detail both the ‘works ’ and the ‘papers ’, but, 
as a light upon the character of Petty, the Southwell correspondence is the 
strongest we have, Southwell himself was some generations farther away from 
adventuring than Petty. He came of an ‘undertaker’ stock — the adventurers in 
Ireland of Queen Elizabeth’s time — and his father was vice-admiral of Munster 
before him. He was born in 1636 (died in 1702), regularly educated (Queen’s 
College, Oxford and Lincoln’s Inn), knighted in 1666, for some time Clerk of the 
Privy Council, in the diplomatic service, held other offices, was a member of 
parliament and eventually settled in a country house near Bath. He was 
President of the Royal Society 1690-6. He might be described as a lesser 
William Temple ; better educated and less selfish, not so able, but with the same 
cool, cautious judgment ; a psychological antithesis of his correspondent. 

* Petty married in 1667 Lady Fenton, widow of Sir Maurice Fenton and daughter of Sir 
Hardresa Waller who, knighted in 1629, fought for the Parliament and waa one of the King’s 
judges; he was a 7uajor general in Ireland in 1650-1 and a patron of Petty there. 
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The correspondence covers the eleven years 1676-87. Both men were, even 
by modern standards, middle aged. They write one to another with complete 
frankness; there is a remarkable absence of the elaborate verbal formalities 
which in seventeenth-century and even eighteenth-century letters are so 
wearisome. 

Petty’s side of the correspondence consists roughly of domesticities 10 parts, 
eager accounts of his quarrels and law suits concerning money 40 parts, discussion 
of papers or projected papers 40 parts, add autobiographical boasting to make 
up the 100. 

In the purely domestic part of the correspondence, Petty is seen as a kind, 
good-natured father interested in the doings of his relations by marriage, also 
as a very bad judge of others’ feelings. I remember to have read an unpublished 
letter by the famous Edwin Chadwick, the great and very unpopular sanitarian 
of a century ago. It was written to a friend whose wife had just died of puerperal 
fever. Chadwick expressed regret in the shortest possible formula and assured 
his correspondent that the best solace he could have would be to assist in 
pushing forward a bill (which I think he enclosed) to promote some sanitary 
reform which would have the effect of making it less likely that other men would 
lose their wives in childbed. I remember thinking that, however sensible the 
recommendation, the man who gave it was not likely to bring much comfort to 
his friend. 

Petty was very much lilre Chadwick here. Southwell lost his wife in 1681 
and Petty condoled with him as follows; 

When your good father dyed, I told you that hee was full of years and ripe fruit, and 
that you had no reason to wish him longer in the paines of this world. But I cannot use 
the same Argument in this Case for yotu? Lady is taken away somewhat within half the 
ordinary age of Man and sooir after you have been perfectly married to her; for I cannot 
believe your perfect union and assimulacon was made till many years after the Ceremonies 
at Kinsington. 

What I have hitherto said tends to aggravate rather than mitigate your sorrow. But 
as the sun shining strongly upon burning Coles doth quench them, so perhaps the sadder 
Sentiments that I beget in you may extinguish those which now afflict you. The next Thing 
I shall say is. That when I myself married, I was scarce a year younger then you are now, 
and conserpiently do apprehend That you have a second Crop of Contentment and as much 
yet to come as ever I have had. 

This remark, curiously enough, was not well received. 

You doe not onely condole the great loss I have sustained in a wife, but you seeme to 
think it reparable. . . . But when by 19 yeares conversation I Icnew the greate vertues of her 
mind, and discover since her death a more secrett correspondence with Heaven in Acts of 
Pietye and devotion (which before I knew not of), you will allow me, at least for my 
Children’s sake, to lament that they have too early lost their guide. 

Petty could not, it seems, understand that Southwell was wounded and 
returned to the charge in a letter which is lost. That letter provoked a reply 
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which even Petty could not misunderstand and elicited an apology {Correspon- 
dence, p. 90). 

Petty was quite incorrigible. A few years later Southwell had another family 
bereavement and is condoled with in the following terms : 

That by the death of your Father, Mother and Sister, of Sir Edward Deering and your 
three nephews, you are the Head and Governor of both Familyes. That by the death of 
Rupe, Ingenious Neddy culminates ; and by that of your Excellent Lady you are entitled to 
that million I mentioned of unraarryed teeming Ladyos. 

Once again, Southwell was not comforted. ‘Cousin, you doe wipe off Teares 
at a very strange rate, but wliy did nature furnish Them if there must be no 
Sorrow 1 ’ 

Petty had a very quick perception of when and where his shoe pinched, but 
no imaginative sympathy. 

Passing to Petty’s financial affairs and lawsuits, the position was this. By 
original grants, by purchase and in various ways, Petty had widely scattered 
Irish interests. Questions of the validity of the original grants, of rent charges 
due to the crown or to other grantees, of matters of fact and matters of law were 
endless. Petty saw himself steadily as a great public benefactor harassed by 
scoundrels, and it never occurred to him even as a theoretical possibility that 
others had rights. Of his manner of proceeding the editor of the correspondence 
gives a typical example {Correspondence, p. 90). In 1681 Petty gave evidence 
before Lord Chief Baron Hen as to ‘Soldier’s land’ which he had bought in 
Kerry and, it seems, the court decided against him. 

Petty gave vent to his chagrin in a long and scurrilous lampoon against the offending 
judge, entitled: ‘henbalogie or the legend of Hen-Hene and Pen-Hene’, in two parts. 
Whereof the first doth in 24 chapters of Raillery, contain the enchantements, metamor- 
phoses and merry conceits relating to them. The second part contayning (in good earnest) 
the foolish, erroneous, absurd, malicious and ridiculous ‘judgements of hbn-HENe’. 
Fortunately perhaps for the repute of its author, this diatribe was never made public. 

Fortunately, also, for a more material reason; it would probably have led 
to a second incarceration for contempt of court. 

Southwell evidently viewed his good cousin’s proceedings with a mixture of 
gentlemanlike annoyance and practical minded contempt. He expressed these 
feelings more than once; the following extract from a letter of 1677 is typical; 
the particular suit in progress to which reference is made was a claim for £6000 
in respect of a sum of £2500 actually advanced by Petty to the Farmers of 
Revenue. 

And suffer from me this expostulation, who wish your prosperity as much oa any man 
living; and having opportunities to see and hoare what the temper of the world is towards 
you, I cannot but wish you well in Port, or rather upon the firm Land, and to have very 
little or nothing at all left to the mercy and good will of others. For there is generally 
imbibed such an opinion and dread of your superiority and reach over other men in the 
wayes of dealing, tliat they hate wliat they feare, and find wayes to make him feare that is 
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feard. I doe the more freely open, my soul to you in this matter, because tis not for the 
vitells that you contend, hut for outward Limbs and accessions, without which you can 
subsist with Plenty and Honour. And therefore to throw what you have quite away, or at 
least to put it in dayly hazard onely to make it a little more than it is, Is what you would 
condemne a thousand times over in another. And you would not think the Reply sufficient 
that there was plain Right in the Cause and Justice of their side, for iniquities will abound 
and the world will never be reformed. 

After all this is said, I mean not that you should relinquish the pursute of your 2600£, 
which is money out of your Pockett and for which you are a Debtor unto yom Eamily. 
But for other pretensions, lett them goe for Heaven’s Sake, as you would a hott coale out 
of your hand : and strive to retire to your home in this Place, where you had the respect of 
all, and as much quiet as could be in this life, before your medling with that pernicious 
business of the Earme. 

There is no reason to suppose that Petty ever took such sensible advice. Yet, 
somehow, he kept his head well above water. 

In the later part of the correspondence Petty indulges in that complacent 
financial retrospect which he inserted in his WiU and I have, perhaps too harshly, 
described as autobiographical boasting. It is possible that Southwell had heard 
of these financial triumphs rather often ; at least there is a hint of this in the 
following : 

I will onely note that since you are soe Indulgent as to think me worthy of being your 
Depositary in this great Audit, and expect by the Course of .Nature that I should speake 
when you are Silent, you must allow me Kberty without blame to aske questions when you 
seeme defitient or Redundant. 

That you are defitient may be suggested when, on the fortunate syde, I find noe Item 
for my Lady or of the hopefull stock she has brought you (p. 227). 

The shrewd thrust of the last sentence was deadly. The subject does not 
recur. 

I have indicated the character of the non-scientific part of the correspondence 
because we must examine Petty’s scientific writings in greater detail. I think, 
however, we have enough to justify a provisional diagnosis of Petty’s psycho- 
logical type. 

In literature and in life the perennial boy is often encountered. But while 
Peter Pan and Mr Reginald Fortune make far more friends than foes, that is not 
so true of their living counterparts. The exuberant flow of ideas and schemes, the 
intense and restless interest in everything which is characteristic of the clever 
child, often is extraordinarily attractive when it is associated with and con- 
trolled by the trained intelligence of a man. But the bad as well as the good 
points of a childlike or adolescent souP are to be brought into the account. The 

* The first Marquis of Halifax said of King Charles that ‘ his inolinations to love were the efieots 
of health and a good constitution,' with as little mixture of the seraphic part as ever man had ’, and 
Petty held that the King was typical. In The Petty Papers (no. 93 of vol. 2) there is a memorandum 
headed ‘ Californian Marriages with the Reasons thereof’ . ‘ In California ’ , says Petty, ‘ 6 men were 
oonjugerted to 6 women in order to beget many and well conditioned children, and for the greatest 
venereall pleasure, in manner following, viz.’ 

He then sets out the plan. One man ‘exceUing in strength, nimbleness, beauty, wit, courage 
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clever child is often naively and intensely selfish, and so remains as the eternal 
boy ; his quite crude and unashamed egoism, his inability to understand that 
others have feelings and even rights, repel as strongly as his intellectual freshness 
attracts. How far he is a success in life depends on which way the balance 
turns. 

Petty seems to me a good example of tliis psychological type; its good points, 
the restless energy and exuberant flow of ideas, were sources of strength in such 
a time as that of the Civil War and Restoration, which, particularly the Restora- 
tion period, was in virtues and vices an age of grown-up children. Indeed his 
emotional adolescence may have shielded him from the deadly enmity of real 
men. Its bad points made him enemies, but they were children Hire himself, 
nearly a century later, in a time of adults, these same charaeteristics, restless 
intellectual energy and vanity, exhibited by one no longer a roUicking adventurer 
but a great landowner, produced an unfavourable balance and we have ‘Mala- 
grida’. In Malagrida’s son, one has a change; the attractive traits, the eager 
interest in all sorts of things is still there, but the childish hungry vanity has 
been softened or sublimed. The cynic may say that it was easy for a great Whig 
lord 150 years ago to be agreeable, to keep himself hors concours; perhaps it was, 
although the Dropmore Papers raise doubts. The fact, however, is certain. In the 
third Lord Lansdowne one sees the good and in the first the bad effects of the 
perennial boyishness of the ancestor. The ancestor lived in a state of society 
where the good points out weighted the bad points. That is why, although he 
made enemies and was often vexed, he was able to view his career with com- 
placency and to bequeath a great fortune. But it is not Petty as a man but 
Petty as a scientific worker who is the proper object of my study. 

How far does the psychological make-up which, as I think, characterized 
Petty conduce to scientific investigation? We might expect that it would be an 
immense stimulus to pioneering, that such a man would direct attention to a 
number of problems which deserved study, but that it would not lead to the 
production of any solid contribution to knowledge. Our task is to examine in 
some detail Petty’s scientific work. 

and good sense’ subsequently called the Hero, is allowed four women for his sole use, One Groat 
Rich Woman is allowed five men who are to serve her when she pleases, but another woman is 
allotted to the five men for use in common by the five. 

It may ho said this fable is only an after dinner jest— perhaps that ia the whole explanation. 
But Petty does go to the trouble of financial calculations, and does seem to suggest a serious con- 
sideration, {‘The encrease of children will be great and good.’ ‘No controversy about joynture, 
dower, maintenance, portion etc.’) Nobody emotionally adult would be likely to make Californian 
Marriages a basis for practical statecraft. 
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II. PETTY’S SCIENTIFIC WORK 

It is no part of my undertaking to survey the whole of Petty’s scientific 
activities, but to speak only of his medical and vital statistical work. 

In Hull’s edition of Petty’s writings, the editor discusses Petty’s status as an 
economist and remarks that Petty’s view that value depended upon labour was 
probably derived from Hobbes. The corn rent of agricultural lands was in Petty’s 
view determined by the excess of their produce over the expenses of cultivation, 
paid in corn, and the money value of the excess will be measured by the amount 
of silver which a miner, working for the same time as the cultivator of the corn 
land, will have left after meeting his expenses with a part of the silver he secures 
(Hull, p. Ixxiii). Why there should be any surplus, he explains by density of 
population. 

Prof. Hull refrained from attempting to assess Petty’s work in terms of 
modern economic theory. A mere medical statistician wiU naturally follow this 
example. More than a century ago, Mr Chainmail had learned from Mr MacQuedy 
that the essence of a safe and economical currency was an interminable series of 
broken promises and added: ‘There seems to be a difference among the learned 
as to the way in which the promises ought to be broken; but I am not deep 
enough in their casuistry to enter into such nice distinctions.’ Medical statisti- 
cians may well adopt Mr Chainmail’s modest attitude towards the whole field 
of economic theory. Confining ourselves to statistics, we must consider what 
Petty thought should be done and what he actually did himself. 

Under the first heading, praise can be unstinted. More than 160 years before 
the establishment of the General Register Office, Petty specifically proposed the 
organization of a central statistical department the scope of which was wider 
than that of our existing General Register Office. It was to deal not only with 
births, marriages, burials, houses, the ages, sexes and occupations of the people, 
but with statistics of revenue, education and trade (see The Petty Papers, 1, 
171-2). He did not confine himself to vague recommendations, but drew up an 
enumeration schedule to be used for each parish. On this was to be entered: 
The number of housekeepers and of houses ; the number of hearths ; the number 
of statute acres; the number of people by sex and in age groups, viz. under 10, 
between 10 and 70, over 70; for males those aged 16 to 60, and for females those 
between 16 and 48 and how many of these latter were married; how many 
persons were incurable impotents and how many lived upon alms. This, it will 
be noted, is a better enumeration schedule than any used in England before the 
census of 1821. Further in his notes (printed in The Petty Papers) are various 
suggestions for the utilization of data collected in this way. 

The most striking is this : ‘The numbers of people that are of every yeare old 
from one to 100, and the number of them that dye at every such yeare’s age, 
do shew to how many yeare’s value the life of any person of any age is equivalent 
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and consequently makes a Par between the value of Estates for life and for 
years’ (The Petty Papers, 1, 193). 

This is, I think, the most remarkable thing Petty ever wrote, for it suggests 
that he had grasped the principle of an accurate life table, viz. a survivorship 
table based upon a knowledge of rates or mortality in age groups. No such table 
was constructed from population data until the end of the eighteenth century, 
because until then data of the age distribution of the living population were not 
obtained. Whether Petty also realized that under certain conditions a life table 
could be constructed without knowledge of the ages of the living population is 
a controversial matter which I shall discuss later on. 

Then he makes suggestions which are relevant enough to modem demo- 
graphic problems. 

By the proportion, between, marriages and births, and of mothers to births, may be 
learnt what hindrance abortions and long suckling of ohildren is to the speedier propagation 
of mankind; as also the difference of soyles and ayres to this foeoundity of women. 

By the proportion between maryd and urunaryd teeming women, may be found in what 
number of yeeres the present stock of people rhay bee encreased to any number assigned 
answerable to the defect of the peopling of the nation for strength or trade. 

There are not wanting some suggestions which imply that even if Petty’s 
opinion of the Faculty were higher than that of Sydenham (whom we honoured 
posthumously) it was tinged with scepticism. 

Whether they [viz. fellows and licentiates of the College of Physicians] take as much 
medicine and remedies as the like number of any other society. 

Whether of 1000 patients to the best physicians, aged of any decade, there do not die 
as many as out of the inhabitants of places where there dwell no physicians. 

Whether of 100 sick of acute diseases who use physicians, as many die and in misery, 
as where no art is used, or only chance. (The Petty Papers, 2, 169-70.) 

This statistical experiment has not yet been performed and indeed might be 
hardly so conclusive as Petty implied. 

When one passes from what Petty, suggested to what he actually did himself, 
our praise must be qualified. As Prof. Hull said, he was ‘more than once misled 
into fancying that his conclusions were accurate because their form was 
definite’. 

In judging Petty it is hut fair to contrast him with College contemporaries 
whose names are more honoured by us. Among his contemporaries in the 
College were Thomas Browne and Thomas Sydenham. Browne was a much 
older man than Petty, Sydenham almost his coeval. Of Browne’s quality as a 
physician we know nothing; but his literary influence indirectly — through 
Samuel Johnson — and directly upon generations of readers has been greater than 
that of any other practising medical man. Browne, like Petty, had an enormous 
range of interests and his book learning was greater. But, as we shall see, when 
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he tackles a problem of demography, Petty’s rashest guesses seem by com- 
parison as soberly scientific as an aimual report of the Registrar-General. 

Sydenham was an iconoclast in clinical practice and believed himself to be 
emancipated from the rule of ancient authority. No fantastic arithmetical 
calculations are to be found in Ua writings. In fact, with a single exception 
(Observations Medicae, 2, i), no arithmetic at all. It never seems to have entered 
his mind, although his greatest work purports to give the history of the diseases 
in London through a generation, that the arithmetical statements of the, London 
Bills of Mortality were of any value whatever. 

Sydenham was too wise a man for us to think that he rejected the evidence 
because the data were compiled by illiterate old women. He would have known 
that the sworn searchers had the loquacity of their sex and rank and were likely 
to ask what ‘the doctor said’. He rejected it, because counting and measuring 
things did not come within his purview, just as the first beginnings of pathology 
aiid medical chemistry seemed, to him irrelevant. 

For the most part, Petty’s statistical work was severely practical, but there 
is one excursion into theory which is interesting. It is to be found in a section of 
his tract on the use of what he calls Duplicate Proportion and is reprinted by 
Hull (pp. 622-3). 

Petty states that there are more persons hving between the ages of 16 and 
26 than in any other decade of life. The statement is not true for modern 
populations and was probably not true for the English population of Petty’s 
time. In 1861-71 (before the faU in the birth rate and infant mortality rate) 
there were 6-4 millions living under 10, and 4-0 between 16 and 25), But perhaps 
Petty meant that there were more hving in the decade 16 to 26 than in any later 
decade, in which case his statement was of course right tmless the birth rate was 
faUing. 

He then asserts that the 

Root® of every number of Men’s Ages under 16 (whose Root is 4) compared with the 
said number 4, doth show the proportion of the likelyhood of such men reaching 70 years 
of Age. -As for example: ‘Tis 4 times more likely that one of 16 years old should live to 70, 
than a new bom Babe. ‘Tis three times more likely, that one of 9 years old should attain 
the age of 70, than the said infant. Moreover, ’tis twice «s likely, that one of 16 should reach 
that Age, as that one of four years old should do it; and one third more likely, than for 
one of nine. 

We have no life table for England in 1674. Perhaps the nearest modern 
experience might be the Liverpool Table calculated by Parr seventy years ago. 
According to that table the chance of a new-born child living to be 66 was 
0-0976 and the chance of a person of 15 living to 66 was 0-202, which is about 
double the infant’s chance, not four, times as large. For the Healthy Districts, 
the chances are 0-4246 and 0-64686; that is, in a ratio of 1-28 to 1. 

Petty’s statements are wildly wrong. The interesting point is how did he 
reach them? The only figures he had were printed by Graunt. 
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This 'Life Table’ gives as follows: 

l„ 100 

l« 04 

li6 40 

Lo 25 

Lo 10 

Now if we take 2 as the survivors to 70 (it does not of course matter what the 
numerator is for comparative purposes), then the infant’s chance of surviving 
to 70 is 0-02 and the person of 16 has the chance 1/20 = O-OS, a ratio of 2-5, not 
wildly different from the Liverpool Table figure and very different from 4-0. 

A fortiori when Petty, having passed above age 16, asserts that ‘it is five to 
four, that one of 26 years old will die before one of 16 ; and 6 to 6 that one of 36 
will die before one of 26 we are in a region of pure fantasy because, even if he 
had had the statistical data. Petty would not have had the technical knowledge 
to solve the problem involved, viz. to find the probability that of two lives a ged 
respectively x and y, the former will faU before the latter. 

If we keep within the range of the simple arithmetic which Petty used, the 
result cannot be obtained. 

He then passes to this statement: 

To provo all which I can produce the acoompts of every Man, Woman, and Child, 
within a certain Parish of above 330 Souls; all which particular Ages being cast up, and 
added together, and the Sum divided by the whole number of Souls, made the Quotient 
between 16 and 16; which I call (if it be Constant or Uniform) the Age of that Parish, or 
Nwmrus Index of Longaevity there. Many of which Indexes for several times and places, 
would make a useful Scale of Salubrity for those places, and a better Judg of Ayers than 
the conjectural Notions wo commonly read and talk of. And such a Scale the King might 
as easily make for all his Dominions, as I did for this one Parish. 

The puzzle is to discover why Petty thought this statistical experiment 
proved his point and why he regarded the mean age of the population of a parish 
its index of longevity. The first question I cannot answer at all; about the second 
I can make a guess. If the parish population were supported solely by births and 
there was no migration, then, if the death rates at ages did not vary, the popula- 
tion would be a stationary population and both the mean age of the living and 
the mean age at death would be constant. The expectation of life is greater than 
the mean age of the living unless the rates of mortality at early ages are very 
high and the more favourable the rates of mortality the greater wiU be the 
difference. In Petty’s day, when mortality at early ages was very high, the two 
constants were probably not far apart, but it is certain that both expectation of 
life and mean age of a life table population were greater than 16; probably of 
order 28 to 32. 

I think we may be sure that the parish Petty counted was not stationary in 
the statistical sense, but had an excess of births over deaths, and that his average 
threw no light upon the rates of mortality. 
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Passing to practical statistics, it will be convenient first to note rapidly 
statistical observations which are incidental in treatises of primarily financial or 
economic interest. In the Verbum mpienti, which although not printed until 
1691 was written as early as 1665, Petty attempts to reckon what a man is 
worth. Here is the method. He concludes from financial data that the annual 
proceed of the Stock or Wealth of the nation yields 15 millions, but that the 
expenses of the nation are 40 jniUions. So the balance of 25 millions must be 
derived from the labour of the people. He assumes that the population is 6 
millions and that half of these can work, and earn £8. 6s. 8d. a head per 
annum. This would be Id. a day, abating 52 Sundays and half as many other days 
for sickness, holidays, etc. ‘Whereas the Stock of Kingdom, yielding but 15 
Millions of proceed, is worth 250 Millions; then the People who yield 25, are 
worth 416. 2/3 Millions. For although the Individuums of Mankind be reckoned 
at about 8 years purchase ; the Species of them is worth as many as Land, being 
in its nature as j)erpetual, for ought we know.’ 

Why an individual’s working life is worth only 8 years’ purchase is not clear. 
One would be inclined to put it as the average number of years lived in the 
working period of life. Perhaps Petty took Graunt’s table and worked out the 
average number of years of life lived between the ages of 16 and 56; it is 
nearly 8. 

He then calculates the money loss due to 100,000 dying of the plague and 
makes it nearly 7 millions, adding that £70,000 would have been well disposed 
in preventing this ‘ centuple loss ’. Perhaps this is the first printed statement 
of the neglected truth that public health measures pay. 

Since Petty’s day, others, including Farr himself, have done sums of this 
kind; it is a popular occupation in the United States of America. 

Farr went to work more elaborately, making out a balance sheet of a man 
from the cradle to the grave. But the principle was much the same. We cannot 
say it is a wholly useless pastime. There is of course the difficulty that if more 
lives are saved the price of labour might fall. But to Petty that would have been 
no difficulty, because he held that wealth is purely relative, viz. that if the income 
of each person in a community is halved, everybody is as well off as before. 

In the Political Anatomy of Ireland, Petty seeks to determine war losses in 
Ireland. 

The number of the People being now Anno 1672 about 1,100,000 and Anno 1662 about 
850 M. Because I conceive that 80 M. of them have in 20 years encreased by generation 
70 M. by return of banished and expelled ISnglish-, as also by the access of new ones, 
80 M. of New Scots, and 20 M. of returned Irish, being all 260 M. 

Now if it could be known what number of people were in Ireland Ann. 1641, then the 
difference between the said number, and 860, adding unto it the increase by generation in 
11 years will shew the destruction of people made by the Wars, viz. by tjie Sword, Plague 
and Famine occasioned thereby. 

I find by comparing superfluous and spare Oxen, Sheep, Butter and Beef that there was 
exported above 1/3 more Ann. 1664 than in 1641, which shews there were 1/3 more of 
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people, viz. 1,466,000. Out of which Sum taka what were left Ann. 1652, there will remain 
616,000 destroyed by the Rebellion. 

Whereas the present proportign of the British is ns 3 to H; But before the Wars the 
proportion waa less, viz. as 2 to 11 and then it follows that the number of British slain in 
1 1 years was 112 thousand Souls ; of which I guess 2/3 to have p erished by War , Plague and 
Famine. So as it follows that 37,000 wore massacred in the first year of Tumults : So as 
those who think 164,000 were so destroyed, ought to review the grounds of their Opinions. 

It follows also, that about 604 M. of the Irish perished, and were wasted by the Sword, 
Plague and Famine, Hardship and Banishment, between the 23 of October 1641 and the 
same day 1652. Wherefore those who say, That not 1/8 of them remained at the end of the 
Wars, must also review their opinions; there being by this Computation near 2/3 of them; 
which Opinion I also submit. 

Assuming, which is rash, that the estimates of population in 1672 and 1662 
are correct, the assumption that population varied inversely as exportation of 
cattle seems bold. Might it not be that shipping facilities were better in 1664 
than in 1641 ? Had there been no exportation we could not infer the population 
to be infinite. 

Again Petty has multiplied the estimate for 1672 by 1-333. But he needed 
the population of 1664, which presumably was smaller than that of 1672. If his 
estimate is right, the population was increasing at the rate of about 12-5 
thousands per annum, so he should have multiplied 1,000,000 not 1,100,000 by 
1-333 and has overestimated the 1641 population by 133,330, and therefore the 
number destroyed by the same amount, an overstatement of 20 %. But this is 
not ah. If we assign the decrement of population between 1652 and 1641 wholly 
to sword, plague and famine, we must assume that births continued at the peace- 
time rate ; not a likely assumption. Lastly, it seems unreasonable to assign the 
casualties to the two races in precise proportion to their estimated numerical 
strength in the population of 1641. 

How it follows that 37;000 were massacred in the first year of tumults I do 
not know. 

In a later- work {Treatise of Ireland,, pp. 610-11) Petty has another shot at 
this problem. 

He now assumes that Graunt’s deduction from a Hampshire parish register, 
viz. that christenings are to burials in the ratio of 5 to 4, applies to Ireland, and 
that the death rate is 1 in 30, i.e. about what Graunt estimated for London and 
much higher than his estimate for the country. He then proceeds in this way. 
He estimates the population of 1663 to be 900,000 and that of 1687, 1,300,000. 
Then taking 1/30 for the death rate and 1/24 for birth rate, he makes the 
population of 1662, 985,000, He does not comment on the great decrease be- 
tween 1662 and 1653; but there was still war in Ireland in 1662. 

He now sayS that the population of 1641 waa greater than that of 1687, ‘as 
appears by the Exportations, Importations, Tyths, Grist-Mills and the Judg- 
ment of Intelligent Persons’. This time he takes the population to be 1,400,000 — 
a little less . than, in the earlier estimate — and by the same kind of reasoning 



Major Greenwood 


121 


again makes the war losses to be about 600,000. One is reminded of Hull’s 
remaik that Petty confused the accurate with the definite. Also one notes the 
inevitable tendency of a polemical writer — ^which Petty very decidedly was — 
to maintain his original assertion. Those of us who have never yielded to this 
temptation may cast stones at him. It is not I believe too cynical to say that 
any calculation Petty made would have made the war losses around 600,000. 

Returning to the Political Anatomy of Ireland, we find here a distinct claim 
that the mean age at death (not the mean age of the living) measures longevity. 

As to Longaevity, inquiry must be made into some good old Register of (suppose) 
20 persons, who were all born and buried in the same Parish, and having oast up the time 
which they all lived as one man, the Total divided by 20 is the life of each one with another ; 
which compared with the liko Observation in several other places, will show the difference 
of Longaevity, due allowance being made for extraordinary contingencies and Epidemical 
Diseases happening respectively within the period of each Observation (p. 172). 

Apart from what we should think the absurdity of basing important con- 
clusions upon an average of 20 — and Petty only gives 20 as a figure — the mean 
ages at death of different populations are not comparable unless in each place 
the population is stationary in the sense described above. But, since so acute a 
man as Edwin Chadwick made the same mistake in the nineteenth century as 
Petty in the seventeenth century and it continues to be made in various places 
in the twentieth century, we need not be superior. 

We now come to Petty’s purely statistical work which is concerned with the 
growth of population; before examining this in detail, it will be convenient to 
consider the methods available in the seventeenth century for estimating 
population and notions then current on what may be called the theory of 
population growth. 

It is hard to believe that in the ancient world nobody studied demography 
arithmetically. There is evidence that the Romans enumerated citizens — the 
word census is pure Latin — and it has been suggested that the Romans made 
life tables. Gouraud, cited by Todhunter [History of the Mathematical Theory of 
Probability, p. 14), refers to a passage cited from Ulpian in the Digest which I 
have discussed elsewhere.* The question was of the value of annuities and the 
conclusion I reached was that Ulpian had no vital statistical basis whatever for 
his figures, that he simply began with the capital value the law gave for any 
usufruct and. then, realizing that people do die eventually, made some sub- 
tractions, ending with the absurd (vital-statistically speaking) conclusion that 
after the age of 60 the rate of mortality was independent of age. 

There is not, I think, any reason to bebeve that the practical Romans had 
anticipated Graunt and Petty. 

That is not to say that nobody studied any demographioal problems arith- 
metically. Indeed one fellow of the CoUege of Physicians who has had — and will 

* Joum. Boy. Slat. Soc. 103 (1940), 246. 
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continue to have — a hundred readers for every one reader of Graunt and Petty 
made an elaborate . demographical calculation. This was Sir Thomas Browne. 
Sir Thomas devoted the sixth chapter of the sixth book of Pseudodoxia to the 
vulgar opinion that the earth was slenderly peopled before the Flood. 

This vulgar opinion Sir Thomas found to be very wide of the mark. Indeed, 
far from the earth being slenderly peopled, ‘we shall rather admire how the 
earth contained its inhabitants, than doubt its inhabitation: and might con- 
ceive the deluge not simply penall, but in some way also necessary, as many 
have conceived of translations, if Adam had not sinned, and the race of man had 
remained upon earth immortal’. Indeed Sir Thomas estimates that by the 
seventh century of the world’s history its population amounted to 1,347,368,420. 
He reaches this result in the following way: 

Having thus declared how powerfully the length of lives conduced unto populosity of 
those times, it will yet be easier acknowledged if we descend to particularities, and consider 
how many in seven hundred years might descend from one man; wherein considering the 
length of their dayes, we may conceive the greatest number to have.been alive together. 
And this that no reasonable spirit may contradict, we will declare with manifest dis- 
advantage ; for whereas the duration of the world unto the flood was about 1,600 years, we 
will make our compute in less than half that time. Nor will we begin with the first man, 
but allow the earth to be provided of women fit for marriage the second or third first 
centuries; and will only take as granted, that they might beget children at sixty, and at 
an himdred years have twenty, allowing for that number forty years. Nor will wo herein 
single out Methuselah, or account from the longest livers, but make choice of the shortest 
of any we find recorded in the Text, excepting Enoch : who after he had lived as many years 
as there be days in the year was translated at 366. And thus from one stock of sevenhundred 
years, multiplying still by twenty, we shall find the product to be one thousand, three 
hundred forty seven millions, three hundred sixty eight thousand, four hundred and 
twenty. 

1 . 20 . 

2. 400. 

3. 8,000. 

4. 160,000. 

Century. 6. 3,200,000. 

6. 64,000,000. 

7. 1,280,000,000. 

1,347,368,420. 

Simply as a sum, there are difficulties about this result. If our 20 are equal 
numbers of males and females, it is not 20 ■which should he multiplied by 20 but 
10. If they are all males, then -women are left out of the reckoning. But, per- 
haps, as the Text does not record the ages of -women, Sir Thomas esteemed them 
as ephemerids, sufficiently plentiful ho-wever to pro-yide a -wife for every husband. 
But then I think he should have said that the 20 to be begotten between 60 and 
100 were aU males. Anyhow the sum must he wrong because some of the 
64 ,000, 000 short-lived women of the sixth century should survive into the seventh. 
Indeed Sir Thomas uses his data a trifle capriciously. 

We must surely play a game according to the rules. We are to accept the 
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Text word for word as it stands. But, omitting Adam, whose age at his begetting 
of Cain is not recorded, and Noah, who seems to have reached middle age — 
500 years — ^before becoming a father, the reproductive habits of eight fathers are 
recorded. Two begat males at the age of 66, one at 70, one at 90, one at 106, one 
at 162, one at 182 and one at 187. When this primary business was over, they 
are all recorded to have begotten an unspecified number of sons and daughters. 
So, if we are to be faithful to the Text, a very much more complicated arith- 
metical problem presents itself. A male begets another male at an average age 
of about 100, he then begets males and females at an unspecified rate for say 
another 600 years, required the law of increase. The Text does not authorize Sir 
Thomas to start pre-diluvian breeding at 66 or to stop it at 100. His ‘manifest 
disadvantage ’ is breaking the rules of the game. 

Further, the Text does not entitle him to predicate of the other males the 
lengths of days and procreative exploits of the recorded eight. 

All this, it may be said, is breaking a butterfly upon the wheel. Nobody now 
takes the statistics of the Authorized Version MteraUy. The point is that Sir 
Thomas Browne did, but used them improperly. As Lord Chesterfield said to a 
Garter King at Arms of his day who had not followed the ndes of heraldry, 
‘You foolish man, you don’t Icnow your own foolish business’. 

Petty did not tackle pre-diluvian demography, but he did try his hand at an 
estimate of the world’s population after the flood, ‘To justify the Scriptures and 
all other good Histories concerning the Number of the People in Ancient Time ’ 
(p. 466). 

As Petty was not going to allow the population of ancient times to be greater 
than in the seventeenth century, but to make it increase regularly from the time 
of Noah’s Ark, common sense saved him from fantastic figures, but not from, 
physiological difficulties. The rules of the game obliged him to start with eight 
landed from the Ark, so he thought it best to make them increase and multiply 
very fast indeed at first and progressively more slowly. At first he doubled the 
population every ten years, but by the birth of Christ has brought the period up 
to 1000 years. But doubling every ten years (in the first century from the Flood) 
leads one into difficulties. 

We can allow the possibility of the four pairs emerged from the Ark pro- 
ducing 8 offspring in ten years and so becoming 16 in year 10, without too great 
difficulty. But ten years later they must number 32 and this is a difficulty. If 
the fecundity of the first settlers remains the same they will contribute 8 more 
children, giving us a population of 24, the balance of 8 must come from the four 
couples of children all of whom must be under 20, and this is a little difficult. 

But at least we may say that there is nothing wholly fantastic in Petty’s 
procedure. Petty does belong to a different arithmetical world from that of 
Browne. Here we may leave purely speculative demography. 

To estimate the people of an area without counting them, we must count 
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something which has a connexion with the number of the people. We may count 
the tax-payers, the houses, the burials, the christenings or the acreage under 
corn— all or any of these items vary with the number of people. 

I wish to keep separate the discussions of Petty’s and Graunt’s statistical 
researches, but in the matter now to be examined Petty used some of Graunt’a 
methods and results, so these must be considered. 

Graunt used three methods of estimation. In the first place, he surmised 
that the number of ohM-bearing women in a community might be about double 
the number of annual births ‘forasmuch as such women, one with another, have 
scarce more than one child in two years ’. Then he surmised that families were 
twice as numerous as women of child-bearing age. His reasoning was that 
women between 16 and 76 might be twice as numerous as women between 16 
and 40 or 20 and 44 (i.e. of child-bearing age), and he thought of a family as 
centred round a married couple. Finally, he thought that the average family 
would consist of eight persons, the husband and wife, three children and three 
servants or lodgers. So, starting with 12,000 christenings, which he thought a 
fair measure of annual births, he reaches 24,000 women of fertile age, then 
48,000 families and lastly 384,000 persons. 

It is quite certain that Graunt’s estimate of an annual fertility rate of 500 
per 1000 was an enormous overstatement. In London in 1851, the ratio of 
legitimate births to married women aged 15-45 was 261' 8 per 1000. There is no 
reason to believe that nuptial fertility changed appreciably between. 1660 and 
I860. But an error of this kind would lead him to an understatement of families. 
Now, however, another error saves him. We cannot be so positive that eight to 
the family is a great overstatement aa we can that the marital fertility was not 
600 per 1000, but it is much higher than any nineteenth-century finding. Using 
this multiplier saves Graunt in this sense, that his quaint rule gives almost 
precisely the right answer for the population of London nearly 200 years after 
his time. 

The legitimate births registered in London in 1851 were 75,097. This, ac- 
cording to Graunt’s rule, is to be multiplied by 32. The result is 2,403,104. The 
enumerated population was 2,363,236; the conjecture is only 1-7 % out. Sic me 
sermvit AfoUo. 

Graunt’s next method was experimental and very briefly described. He 
counted the numbers of families in certain parishes within the walls and found 
that ‘3 out of 11 Families per annum have died’. He then multiplies the burials 
for the year (13,000) by 11/3, and proceeds as before. 

Finally, he took Newcourt’s map of London and 

guessed that in 100 Yards square there might be about 64 Families, supposing every House 
to be 20 Foot in the front: for on two sides of the said square there will be 100 Yards of 
Housing in each, and in the two other sides 80 each; in all 360 Yards: that is 64 Families 
in each square, of which there are 220 within the Walla, making in all 11880 Families within 
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the Walls. But forasmuch as there die within the Walls about 3200 per Annum, and in the 
whole 13,000, it follows that the Housing within the Walls is J part of the whole, and conse- 
quently, that there are 47,620 Families in and about London, which agrees well enough 
with all my former computations (p. 386). 

These conjectures led Graunt to think that the rate of mortality in London 
was about 1 in 32. In his first essay on the growth of London (pp. 458-75) Petty 
bases himself upon that estimate, and in the series of papers (pp. 606-44) this 
remains the fundamental method, but Petty allows himself to modify the 
multiplier, not altogether without suspicion of bias. At a quite early stage he 
had satisfied himself that London was the largest city in the world and much 
larger than Paris. This is the kind of argument. For the three years 1682-84, 
the average of burials in London was 22,337 and for Paris 19,887. So if the rates 
of mortality were the same, London was larger than Paris.* If the rate of 
mortality in Paris were higher than in London then the population of London 
must be larger still. According to Petty {a) a larger proportion of the Paris 
population died in hospital, (6) the mortality in hospital was heavier in Paris 
than in London. So it follows that the general death rate of Paris was higher. 

That at London the Hospitals are better and more desirable than those of Paris, for 
that in the best at Paris there die 2 out of 16, whereas at London there die out of the worst 
Boarce 2 of 16, and yet but a fiftieth part of the whole die out of the Hospitals at London, 
and 2/6 or 20 times that proportion die out of the Paris Hospitals which are of the same 
kind ; that is to say, the number of those at London who ohuse to lie sick in Hospitals rather 
than in their own Houses, are to the like People of Paris as one to twenty; which shows the 
greater Poverty or want of Means in the People of Paris than those of London. We infer 
from the premisses, viz. the dying scarce 2 of 16 out of the London Hospitals, and about 
2 of 16 in the best of Paris (to say nothing oiVhostel Dieu) that either the Physicians and 
Chirurgeona of London are better than those of Paris, or that the Air of London is more 
wholesome (p. 608). 

These, however, are only logical deductions if the user of the hospitals in 
London and Paris is identical. If, as implied in the first part of the quotation, 
we think of hospitals in the sense which our elder contemporaries think of the 
old-fashioned poor law infirmaries, viz. as refuges for the sick poor, it would 
mean that in Paris more of the aged indigent died in institutions than in London 
and heavy mortality might well have nothing to do with the skill or lack of skill 
of the medical staff. If we think of hospitals in the modern sense, then heavy 
mortality might be a mere reflection of the resort to these hospitals of persons 
suffering from illnesses which needed special treatment. In any case. Petty can 
hardly have it both ways. In another essay (pp. 510-11) he contrasts the higher 
ratio of deaths to admissions at Fhostel Dieu of Paris with that of la Charity, 
argues that the excess in Fhostel Dieu is unnecessary and proceeds to calculate 

* It should be remembered that the London of Petty’s calculations is the whole area within 
the Bills. The calculations of Graunt described above did not include Westminster or the six out- 
parishes of Surrey and Middlesex which were within the Bills: Islington, Lambeth, Stepney, 
Newington, Hackney, Redrift. 
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what the French nation would gain by saving this excess. But he has not in- 
quired whether the patients of the two institutions were in pari materia. 

Here is an historical problem which might be solved by those familiar with 
the literature of the period. Its discussion would not be relevant here. It is, 
however, only just to Petty to say that, unless conditions deteriorated seriously 
in the following century, his strictures on I’hostel Dieu were justified. In 
Franklin’s work [La Vie Privie d'autrefois. L'llygiene (Paris 1890), pp. 177 et 
aeq.) an appalling account of this hospital from the pen of the eminent surgeon 
Tenon, printed in 1788, is quoted. Tenon’s description of the routine of this 
great hospital compares, unfavourably, with the story of the wounded in the 
Mesopotamian campaign which horrified England in the war of 1914-18. He 
remarks, inter alia, ‘on ne gu^rissoit point de tr^pands autrefois k I’Hotel-Dieu, 
comme on n’en guerit pas encore aujourd’hui’, and cites a court surgeon of the 
time of Louis XIV, i.e. a contemporary of Petty, to that effect. His account of 
the treatment of lying-in women is grotesquely horrible. 

In another essay (pp. 633-6) Petty discusses methods of estimation more 
carefully than in his other papers. 

He proposes to show that the population of London (within the Bills) in or 
about 1685 was approximately 696,000. 

There are, he says, three methods: (1) From houses and families. (2) From 
an estimated death rate. (3) From the ratio of those who die of the plague to 
those who escape. 

This last we may deal with at once. Petty asserts that Graunt had proved 
that one-fifth of the people died of the plague. But in 1666, 98,000 died of the 
plague ; therefore the population was 490,000, and allowing an increase of one- 
third between 1665 and 1686 we reach 663,000, 

Graunt could not have proved that one-fifth of the population died of the 
plague unless he knew what the population was, and he never claimed to have 
done so. 

The other methods (which Graunt used) are rational. 

To estimate houses, Petty used three methods. He says that in the Fire of 
1666, 13,200 houses were burned and that deaths from these houses were one- 
fifth of total deaths, so he reckons the houses to have been 66,000. Then as 
burials in 1686 were to burials in 1666 as 4 to 3, he makes the houses of 1686, 
88,000. He does not, however, say upon what basis the estimate of one-fifth of 
the deaths in 1666 stands. 

Next, he gives an estimate of the houses in 1682 given him by those employed 
upon a map said to have been made in that year. This map has not been 
identified. 

Lastly, he uses the return of hearths. In Dublin in 1685 the hearths were 
29,326 and the houses 6400. In London the hearths were 388,000 ; so the houses 
on the Dublin ratio should be 87,000. In Bristol he says there were 6307 houses 
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and 16,762 hearths, which give 123,000. houses for London; the mean of the 
calculations is 106,000, The Hearth Office itself, he says, certified the number 
to be 106,315. He must now have a multiplier. He accepts Graunt’s multiplier 
of 8 as valid for tradesmen’s families, but allows for smaller famihes among the 
poor and larger among the rich, finally choosing 6. He then allows for double 
fa-mihes in houses by adding 10,631 to his 106,316, and multiplying the sum by 
6 has 696,076 for the population. 

' Petty’s second way was from an estimated death rate. 

Petty multiplies the average of the burials in 1684 and 1685 (23,212) by 30, 
which makes the population 696,360. 

He now essays to prove that the death rate in London was 1 in 30. He uses 
four arguments, of which only one is strictly to the point, viz. Graunt’s direct 
observation that three deaths occur annually in eleven families — ^which however 
involves the assumption of eight persons to the families observed. Two others 
are relevant, viz. observations, apparently direct, that in ‘healthful places’ the 
mortality is 1 in 60 and in nine country parishes 1 in 37. The fourth partly rests 
upon a statement which Graunt did not make, viz. that one of 20 children under 
10 dies annually. This fictitious value Petty averages with the statement of a 
M. Auzout to the effect that the rate of mortahty of adults in Rome is 1 in 40. 
It will be clear that Petty has proved nothing at all. What he has done is to make 
it unlikely that the rate of mortality was less than 1 in 30. That, perhaps, was 
enough. One has a certain sympathy with his round statement: ‘Till I see 
another round number, grounded upon many observations, nearer than 30, 
I hope to have done pretty well in multiplying our Burials by 30 to find the 
number of the People.’ 

With this I may conclude the analysis of Petty’s statistical work. It will, 
I think, soon be clear enough that it is not of the calibre of Graunt’s. Yet I 
caimot take leave of it without something of an ave. Careless, happy-go-lucky, 
tendentious ; yes, all that. But anybody who has felt the exhilaration, to which 
Francis Galton owned, in the doing of sums concerning biological problems, feels 
his heart warmed by the arithmetical knight errant who had so many statistical 
adventures. 


{To be continued) 
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1. IN'JPROBXTOTION 

The theory of confidence intervals was started by the present author about 1930. 
At that time it was taught in lectures given both at the University and at the 
Central College of Agriculture, Warsaw, Poland. The theory found immediate 
practical applications, and before any theoretical paper was published, a booklet 
(Pytkowsld, 1932) appeared giving numerical confidence intervals for means and 
for regression coefficients. The term ‘confidence interval’ is a translation of the 
original Polish ‘przedzial ufnosci’. The author’s theoretical results appeared two 
years later (Neyman, 1934), At almost the same time the first tables and graphs 
of confidence intervals were published (Clopper & Pearson, 1934) in a paper which 
gave a remarkably clear explanation of the difference between the new approach 
to the problem of estimation and the old one, by means of Bayes’s theorem. 

The first publication on fiducial argument (Fisher, 1930) anticipated the booklet 
of Pytkowsld by two years. The present author -overlooked this article for some 
time. However, when preparing his paper of 1934, he was already acquainted with 
it and also with the next paper (Fisher, 1933) on a similar subject. Although 
Fisher’s method of approach was entirely different from the author’s, the 
numerical identity of Fisher’s fiducial limits with the confidence limits in the 
author’s theory, and also some of Fisher’s early comments, suggested to the author 
that the two theories are essentially the same. Accordingly, and owing to the 
difference in dates of publications, the author considered his own woric as an 
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extension of the previous results of Msher. This was clearly stated in the author’s 
paper of 1934. 

Apart from the above points of agreement the author had found certain 
passages and conceptions in the publications of Fisher which were difficult for 
him to understand and to reconcile with what was essential in the theory of con- 
fidence intervals. They included ‘fiducial probability’ and ‘fiducial distribution 
of a parameter ’. However, the author was inclined to think that these were, more 
or less, lapsus linguae, difficult to avoid in the early stages of a new theory. This 
attitude was clearly expressed in the paper of 1934. That paper was read before 
a meeting of the Koyal Statistical Society and was followed by a public discussion 
recorded in the Society’s Journal. Fisher took part in the discussion, and it was 
a great surprise to the author to find that, far from recognizing them as mis- 
understandings, he considered fiducial probability and fiducial distributions as 
absolutely essential parts of his theory. As a result, the author began to doubt 
whether the two theories were, in fact, equivalent. These doubts were only 
increased by Fisher’s insistence that the calculation of fiducial distributions and 
fiducial limits must be limited to cases where sufficient statistics exist (Fisher, 
1936), and by his warnings against inconsistencies in the theory of confidence 
intervals. 

When questioned on the subject, the author could not conceal his doubts and 
they were published (Neyman, 1938a). Subsequent publications by other authors 
appear to be divided. Some, e.g. the very important papers by Wald (1939) and 
by Wald & Wolfowitz (1939), deal with the theory of confidence intervals, entirely 
ignoring fiducial theory. Others (Starkey, 1938; Sukhatme, 1938; Yates, 1939), 
at the other extreme, work on the ground of fiducial argument and ignore the 
confidence intervals. There is also an intermediate group of authors with an almost 
continuous spectrum of opinions. Pitman (1939), in a very interesting paper on 
estimation of location and scale parameters, states that the two theories ‘ are 
essentially the same and that their two points of view are both necessary for a full 
comprehension of the theory of estimation’. And a few pages further: ‘I at first 
called it the fiducial probabihty function, bub finally decided to shorten the name 
by dropping the word “probability 

Next we find the statement (Bartlett, 1939) that ‘ by a distribution of fiducial 
type we shall mean a distribution providing at least confidence intervals in the 
sense of Neyman’, This statement is used in an argument (Bartlett, 1936, 1939) 
that, as a distribution deduced by Fisher (1936) does not seem to provide con- 
fidence limits, there must he some error in the deduction. A similar point of view, 
but with a stronger leaning towards confidence intervals, is expressed by Welch 
(1939). In this paper various general claims of Fisher are analysed, essentially 
from the point of view of confidence intervals, and tested on appropriate examples . 
Among other things it is found that the fears of inconsistencies in the theory of 
confidence intervals are unfounded. 
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A quite different school of thought is represented by Jeffreys {1940), according 
to which the fiducial approach to the problem of estimation is completely equi- 
valent with that by inverse probability. 

Pisher (1937, 1939a, 19396) and Yates (1939) emphatically deny that there 
is an error in Fisher’s paper of 1936. On the contrary, it is said that the results 
then published were obscured by the controversy arising from Bartlett’s con- 
fusion about the nature of fi.ducial argument. Also, especially in earlier papers 
(1930, 1933, 1936), Fisher is equally emphatic on the distinction between the 
fiducial and the inverse probability approaches to the problem of estimation. 

The above survey shows that there is an interesting divergence of opinions as 
to what is essential in the fiducial theory in general and as to whether it is in any 
way connected with the theory of confidence intervals. The perusal of .all the 
literature quoted does not allow the present author to form any precise opinion 
as to the first of these questions. On the other hand, there now seems to be sufficient 
ground for answering the second, concerning the relationship between the two 
theories. The purpose of the present paper' is to show that there is none. The 
relevant points concerning this question, which were possible to establish on the 
ground of earlier literature, are explained in excellent papers by Pearson (1939) 
and Welch (1939), with the final conclusion that, in spite of various differences, 
the two theories are closely related. However, fresh evidence provided by papers 
of Fisher (1939 a, 19396) and Yates (1939) shows that no such relation exists and 
that the authors suspecting it were misled by the incompleteness of earlier writings 
concerning fiducial argument. 

As a result of the present paper it may be found expedient, for the sake of 
clarity, to avoid confusion of terminologies appropriate to the two theories. 
Instead of writing, as some authors do, on ‘ fiducial or confidence ’ limits, it may be 
preferable to discuss ‘fiducial limits’ or ‘confidence limits’, as the case may be, 
separately. 


2. Basic ideas in the theory of confidence intervals 

The key to understanding the theory of confidence intervals is in being clear 
about what might be called the classical point of view in the theory of probability. 
This theory was originally built up to answer questions about how frequently a 
given combination of throws will occur in a long series of games of dice. Thus; the 
probability of a certain combination found to be, say, 1/6, implies that this com- 
bination would appear in about 20 % of a long series of actual games. This agree- 
ment may, but need not, be observed. In the latter case, we would say that the 
assumptions underlying the deduction were not realized by the actual experi- 
ments. The dice used were perhaps ‘ biased’, and so forth. The point is that, when- 
ever it is said that a given set of probabilities does refer to some phenomena, then 
it is understood that the relative frequencies of various aspects of the phenomena. 
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in a long series of trials, are approximately equal to corresponding probabilities. 
This is just what the author calls the classical point of view in the theory of 
probability. It is excellently explained by v. Mises (1939), but is more general than 
the definition of probability adopted by that author.* 

Apart from the classical point of view on probability, there is another. It 
considers the probabilities as measures of rational belief in the truth of a given 
proposition. Here the agreement between the probability and some relative 
frequency is not essential. 

The theory of confidence intervals was built up to give a solution of problems 
of estimation which would have a clear frequency interpretation, characteristic 
of the classical point of view. Consider a set of observable random variables, 

. . . , and assume as given that the function p{E\d-^,d^, represents its 

elementary probability law. Here represent certain parameters whose 

values are unknown. 

The above should be interpreted as follows. There are some actual trials T 
which are able to determine the values of the a;’s. There are also some numbers 

i9'i, dg, unknown to us, such that, whatever be a region w in the space of 

the aj’s, the integral of p(i/ 1 ■d'g, . . . , d-^) taken over this region is approximately 
equal to the relative frequency with which the point E, as determined by the 
trials T, falls within that region w. The problem of estimating one of the para- 
meters, e.g. ^ 1 , consists in using just one system of the sr’s as determined by the 
trials T to calculate ■d'-y approximately. Alternatively, it may consist in calculating 
an interval {a,a + d) which ‘presumably’ covers fi'i. 

The original approach to this problem is based on Bayes’s theorem. Denote 
by p{6y, 6,,, 0^) the elementary probability law of the d’%. Then 

p{dy,...,d,)p{E'\0y,...,0,) 

jp{dy, ...,0g)p{E' \ dy, ddy, ...,de^ 

will be the relative probability law, or the probability law a posteriori of aU the 
d’s given the observed system E' of the values of the x’b. It can be used to calculate 
the most probable value of 6y. Alternatively, given a number d > 0, the law can be 
used to- find the interval (a, a + d) such that the a posteriori probability 

P{a+d>6i>a \ E'} 

is greatest. 

Our attitude towards this kind of solution, dictated by the classical point of 
view on probability, depends on circumstances and may be twofold. 

The circumstances of the problem may imply not only that the a;’s but also that 
the 6's are random variables and that the function p{9i, ..., could be used to 

* It will be noticed that the classical point of view or probability does not imply any particular 
definition of that concept. It is not suggested that the one adopted by v. Mises is the only one 
that could be consistently used. 


p{6y, d,\E') = 


I- 
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calculate the relative frequencies of various combinations of values of the d’s. 
Such situations are rare, but they do occasionally occur, especially in problems of 
genetics and of mass production. If the function ...,6^) is implied by the 
problem considered, then the probability P{a -f d > > a | E') has a clear fre- 

quency interpretation, as follows. Imagine a long sequence, S, of oases where the 
d’s vary according to the above law and the a;’s are determined by the particular 
trials considered. Pick from this sequence S a subsequence 8{E') of such trials 
in which the experiments determined the same system of values of the x’s, namely, 
the system E\ Naturally, the value of 0^ in cases belonging to 8{E') would vary. 
But, if the functions I d^, 6^) a.nd p{6i, ...,d,) do have the presumed relation 

to the trials considered, it will be found that among all the intervals of length d, 
the interval (a, a + d) will contain the value of 0^ more frequently than any other, 
and that this frequency will be approximately equal to P{a -\-d>d^> a \ E'}. It 
follows that, if the function ...,ds) is implied by the circumstances of the 
problem of estimation, the use of the formula (1) is perfectly legitimate from the 
point of view of the classical theory of probability. 

The situation is quite different when the circumstances of the problem do not 
imply the a 'priori probability law. This is most frequently the ease. Moreover, 
usually there are serious difficulties in considering the d’s as random variables. 
Jeffreys (1939) advises the use of formula (1) also in such oases, with a function 
...,63) invented for the purpose. He claims that the conclusions drawn in 
this way are valid, provided that the function used is j ust the one that he suggests . 
The present author would not question this statement on condition that the word 
‘valid’, or any other such description, is not given any significance beyond that 
described above. In other words, there seems to be no reason why we should not 
agree to call the above conclusions ‘valid in the sense of Jeffreys’. On the other 
hand, it seems essential to be clear that any probability calculated from (1), with 
any function p{0i,..-,0a) not implied by the actual problem, need not and, 
generally, will not have any relation to relative frequencies. It will not be the 
probability in the classical sense of the word and, therefore, persons who would 
like to deal only with classical probabilities, having their counterparts in the 
really observable frequencies, are forced to look for a solution of the problem of 
estimation other than by means of the theorem of Bayes. 

This solution (Neyman, 1937, 19386) may be obtained as follows. Consider 
the case where the circurastaneea imply that the aj’s, forming a system E, are 
random variables with the probability lawp(fi/ j 6^,0^, ...,0„), where ■•■,0s 
are unknown. Denote by Q.{E) and 0{E) two functions of the x‘s. Obviously, if 
E is random then these functions will also be random variables. 

Djemnitioit 1. If the functions Q.{JE) and 0{E) possess the property that, 
whatever be the possible value of 0^ and whatever be the values of the unknown 
parameters 6^, 6^, the probability 

P{&{E) ^ ^ d{E) 1 ^ 1 , 6^,..., 0 s] s a, 


( 2 ) 
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then we will say that the functions d{E) and d{E) are the lower and the up'per 
confidence limits of 6^, corresponding to the confidence coefficient cc. The interval 
[S.(E), d(E)) is called the confidence interval for 

In spite of the complete simplicity of the above definition, certain persons have 
difficulties in following it. These difficulties seem to be due to what Karl Pearson 
(1938) used to call routine of thought. In the present case the routine was estab- 
lished by a century and a half of continuous work with Bayes’s theorem. It may 
be useful, therefore, to give a few illustrations. 

Assume that 5 = 2, that 6^ may have only the five values 1, 2, 3, 4, and 6, and 
that, at the same time, 6^ may vary continuously between zero and 1 . To satisfy 
Definition 1, the only requirement on the functions Q.{E) and d{E) is that 

P{d{E)^d-^d{E)\&,d^sa (3) 

for all values of d' = 1, 2, 3, 4, and 5, and for 02 var 5 dng between (0, 1). The 
probabilities (2) and (3) are, therefore, not the probabilities of 0^ falling within any 
limits. On the contrary, they are the probabilities of the functions Q.{E) and d{E) 
falling on both sides of a specified number d-. These probabilities are to be calcu- 
lated from the given functionp( J® | 2 ) with the value of 0^ set equal to the same 

number •&. The result must be totally independent of the values of 02, ... , 0^ and 
must equal a. 

It is known (Neyman, 19356; Feller, 1938) that in certain cases no such func- 
tions i(E) and d{E) exist. Then there are ways of modifying the formulation of 
the problem, for example, requiring that the probability on the left of (2) be at 
least equal to a, and so forth. In other cases, there will be an infinity of pairs of 
confidence limits all corresponding to the same a. In this case, the practical 
statistician is at liberty to choose among them. 

Let us now consider the frequency interpretation of the solution of the 
problem of estimation by means of confidence intervals. Suppose that some two 
functions d{E) < d(E) possess property (2) with some large value of a, say a = 0- 99. 
Their use in practice would consist of (i) observing the value E' of the x’s, (ii) 
calculating the corresponding values of the confidence limits Q.{E') and 5(F'), and 
(iii) stating that the true value d'l of 6^ lies between d{E') and The justifica- 
tion is simple and perfectly in line with the classical point of view of probability: 
in many applications, the relative frequency of cases in which the statement 
Q.{E) ^ < 0{E) is correct will be approximately equal to a = 0'99, whether or not 
the parameters for estimation are the same in all cases. 

The word ‘ stating ’ above is put in italics to emphasize that it is not suggested 
that we can ‘ conclude ’ that d{E') < •d'l ^ 5 {E'), nor that we should ‘ believe ’ that 
■S-xis actually between 0(JS) and5(il). In the author’s opinion, the word ‘ conclude’ 
has been wrongly used in that part of statistical literature deafing with what has 
been termed ‘ inductive reasoning ’. Moreover, the expression ‘ inductive reasoning ’ 
itself seems to involve a contradictory adjective. The word ‘ reasoning ’ generally 
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seems to denote the mental process leading to knowledge. As such, it can only be 
deductive. Therefore, the description ‘inductive’ seems to exclude both the 
‘ reasoning ’ and also its final step, the ‘ conclusion ’. If we wish to use the word 
‘ inductive ’ to describe the results of statistical inquiries, then we should apply it 
to- ‘behaviour’ and not to ‘reasoning’. The fact that a given pair of functions i(E) 
and d{E) satisfies the identity (2) may be ‘deduced’ from the properties of the 
function p(E\6^, Of). Earlier trials may show characteristics in the empirical 
distribution of the*’ s which seem in agreement with the function p(JS? ] ..., dj. 

On these grounds, after observing the values of the *’s in a case where the d’a are 
unknown and calculating S.{E') and we may decide to behave as if weactually 

knew that the true value ■S'l of dj were between fi(E') and d{E'). This is done as a 
result of our decision and has nothing to do with ‘ reasoning ’ or ‘ conclusion ’ . The 
reasoning ended when the functions d{E) and9(^?) were calculated, The above pro- 
cess is also devoid of any ‘ belief’ concerning the value ■d'l of d^. Occasionally we dp 
^lot behave in accordance with oiir beliefs. Such, for example, is the case when we 
takeout an accident insurance pohoy whileprepaiing for a vacation trip. In doing 
80 , we surely act against our firm belief that there will be no accident; otherwise, 
we would probably stay at home. This is an example of inductive behaviour. 

Obviously, if there are many different pairs of functions, i{E) and ^(E), all 
corresponding to the same a, our choice of the one to use must be based on the 
detailed study of their properties. For example, if it appears that the difference 
between one pair, di{E] — Q.i{E), is always (or most frequently) smaller than that 
between some other pa;ir, then we would probably prefer to use the first. The 
problem of determining the confidence limits and of studying their properties 
forms the subject of the theory of confidence intervals. 

3. NeOESSABY Alrt) STTEEICIENT CONDITIONS BOB A PAIE OF 
FUNCTIONS TO BE CONITDENOE LlSUTS 

Let a{E) < b{E) be any two single-valued functions of the cu’s determined for 
all possible systems of their values. Denote by W the space of the *’s and by 
one of the possible values of d-^^. Finally, let denote the region in the space 
W composed of all points E which satisfy the double inequality, 

a{E) ^ ^ b{E). (4) 

It was proved (Neyman, 1937) that for the two functions, a{E) and b{E), to be the 
lower and upper confidence limits forthe parameter it is necessary and sufficient 
that, whatever be the possible value of d^, the probability 

P{Fe:A(-&i)l<?i = i5>i)^a. (6) 

The identity refers to the arbitrary variation dg. 

This condition will be used below to show that a certain pair of functions does 
not represent the confidence limits. For this purpose, the following steps will be 
taken; We shall select a convenient value of the estimated parameter and 
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determine the region as in (4). Next, we shall substitute this same value 
instead of the parameter d-^ in the elementary probability law of the variables 
considered, getting ...,6g). This last function will be integrated over 

to find the probability P{EeA j as in the left-hand side of (6). But 

this integral will be dependent on the values of the other parameters involved, 
showing that the identity (5) is not satisfied. The conclusion will be that the 
particular functions considered are not confidence limits. 

4. Differences between the theory of ooNifiDENOB intervals 

AND THE THEORY OF FIDUCIAL ARODMBNT 

In this section we will consider examples treated both from the point of view 
of confidence intervals and of fiducial argument. These will be selected to illustrate 
both the conceptual and the numerical differences between the two theories. 

(i) Evidence of conceptual differences between the two theories. The first results 
obtained concerning confidence intervals {Neyman, 1934) refer to the case where 
all the n observable variables are mutually independent, normally distributed, 
have the same though unknown standard error cr, and expectations S‘(Xi) which 
are linearly connected with some s<n imknown parameters ...,pg, so that 

^{Xi) = • • • + O'isPs- (6) 

Here'the d’s are supposed to be known and to form a non-singular matrix. Denote 
by 6 any linear combination of the same^)’s, that is 

6 = -f- + ...+ b^pg, (7) 

with known 6’s not all equal to zero. In these circumstances, a confidence interval 
for d is given by F-St^^O^F + St^, (8) 

where F denotes the best unbiased estimate of 6 (David & Neyman, 1938), the 
estimate of the standard error of F, and t^ the value of the ‘Student ’-Fisher 
corresponding to the number of degrees of freedom n—s and to P = 1 — a. The 
appKoation of more recent theory (Neyman, 19366) shows that the confidence 
intervals (8) have distinct advantages over any others by satisfying the definition 
(Neyman, 1937) of the ‘ short unbiased system of type Without entering into 

these details, we shall consider the particular case where a = 1, = 1 and 

= 1. This will be the case if aU the a;’s come from the same unknown normal 
population and it is desired to estimate its mean, d = <^{xf). In that case F = x 

OT.fera.’ ,91 

* 7l(»-l) • 

Asmentioned, the general-confidence interval (8)was discussedin lecture about 
1930, and in 1932 a pubhcation appeared using the concept and the formula (8). 

As far as is known, the first full discussion of the corresponding result m the 
fiducial theory was given by Fisher a few years later (Fisher, 1935, 1936), and 
here is the relevant passage from the second paper. 
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If a sample of n otservations, se^, has been drawn from a normal population 

having a mean value /t, and if from the sample we calculate the two statistics x =Sxiln and 
s* = S(xi-xfl(n-l), ‘Student’ has shown {1926)’'’ that the quantity t, defined by the 
equation 


t = 


(x—ii).dn 


( 10 ) 


is distributed in different samples in a distribution dependent only from the size of the 
sample, n- It is possible, therefore, to calculate, for each value of n, what value of t will be 
exceeded with any assigned frequency, P, such as 1 % or 6 %. These values of t are, in fact, 
available in existing tables (Fisher, 1926-34). 

It must now be noticed that t is a continuous function of the tmknown parameter, the 
mean, together with observable values, x, s and n, only. Consequently the inequality 
is equivalent to the inequality 

/iKx—st^jdn, ( 11 ) 

so that this last inequality must be satisfied with the same probability as the first. This 
probability is Icnown for all values of h, and decreases continuously as is increased. Since, 
therefore, the right-hand side of the inequality takes, by varying h, all real values, we may 
state the probability that /i is less than any assigned value, or the probability that it lies 
between any assigned values, or, in short, its probability distribution, in the light of tho 
sample observed. 

It is of some importance to distinguish such probability statements about the value of 
/i, from those that would be derived by tho method of inverse probability, from any 
postulated knowledge of the distribution of in the different populations which might have 
been sampled. ... To distinguish it from any of the inverse probability distributions de- 
rivable from the same data it has been termed the fiducial probability distribution, and the 
probability statements which it embraces are termed statements of fiducial probability. 

In the next section we shall analyse the above passage in detail and show 
exactly where and how it conflicts with the classical theory of probability and 
thus with the theory of confidence intervals. Here we will mention only that it is 
amhiguous. Just this kind of ambiguity, which is also found in the earlier papers 
(Fisher, 1930, 1933), is probably responsible for a number of authors, including 
the present one, thinking that the fiducial theory and the theory of confidence 
intervals are lipked. 

In a few years it was found necessary to reinterpret formula (11). This was 
done by Fisher himself ( 1939 b) and, somewhat more clearly but on the same lines, 
by Yates (1939). It wiU be seen from the following quotation from Yates’s paper 
that the above passage by Fisher certainly does not contain everything which is 
now considered essential in the fiducial theory and that the presumption of any 
link between the latter and the theory of confidence intervals is unfounded. 
Yates’s more relevant sentences are italicized by the present author. 

While explaining the meaning of the fiducial distribution of the mean /t of a 
normal population, Yates mentions that the fiducial distribution of cr^ is given by 

‘ ( 12 ) 


(T' 


Six,. 

where has its usual distribution with n- 


■xf’ 

1 degrees of freedom. 


* Actually, of course, this result appeared earlier (‘Student’, 1908). 
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It can then be sbown tbat, for a value of pi equal to and a given s, the value of S in, 
subsequent samples would be as small as that observed in. a fraction e of the samples, 
provided that the actual distribution of cr^ is the same as the fiducial distribution given above. 

In this form, however, the statement is open to objection on the ground. that in subse- 
quent samples cr naay in fact be distributed in any manner, and that s will certainly vary 
from sample to sample. To avoid this objection we must frankly recognize that we have here 
introduced a new concept into our methods of inductive inference, which cannot be deduced by 
the rules of logic frorn already accepted methods. .. .That is... the form of fiducial statement 

which is implicit in the t test as ordinarily used by practical experimenters It must be 

recognized as essentially different from the statement that t will exceed in a fraction e 
of all experiments. The latter is true for any given fixed cr or any set of cr'a. The former 
(i.e. the fiducial statement, J.N.) is true for a given s when c is taken to beftducially distri. 
buted in the appropriate distribution.... Tha logical difference between the two approaches 
(fiducial and inverse probability, J.N.) should, however, he recognized. The approach by 
inverse probability enables fiducial statements about /i to be derived from the classical 
theory of probability, without the introduction of any new principle, hut only at the cost of 
postulating a particular a priori distribution of cr. In the fiducial approach such a priori 
postulation is regarded as inadmissible, but in order to discard it a new principle, that of 
utilizing the fiducial distribution of cr, must be introduced.. . . Once the principle is accepted it 
is possible, given x and s, to make formal and exact statements of the fiducial type about p 
which are independent of all prior knowledge of cr. If the principle is not accepted, then it 
appears that we must either assume an a priori distribution of cr, or deny that there is any 
possibility of making fiducial statements about p. 

The present author re unable to understand the exact meaning of what is 
called ‘fiducial statements about ju,'. However, bis conclusion is that their con- 
ceptual nature must be quite different from that dealt with in the theory of 
confidence intervals. This conclusion is based on the fact that all the difficulties 
described by Yates as inherent in the fiducial theory are non-exisfent in the theory 
of confidence intervals. Applications of the latter require no new principle ‘which 
cannot be deduced by the rules of logic no assumption that this or that unknown 
parameter follows any specified distribution, and have no connexion with Bayes’s 
theorem. To make the situation absolutely clear, imagine a sequence of normal 
populations ...,7r„, ..., with their means ...,6^^, and their stan- 
dard deviations cr^, cTj, ...,cr^, .... Imagine that out of each population 7r„, we 
have a random sample of % individuals, with its mean ^-nd an estimate of 
the corresponding variance as in (9). The theory of confidence intervals 
guarantees that the relative frequency with which fall short of the 

corresponding and, at the same tune will exceed this same number 

6.^, will be, within an error of sampling, equal to a. An incredulous reader may 
easily check this by a sampling experiment. In this he will be at liberty to keep 
6^ and/or cr^ constant, or to vary them at his, pleasure, without any restriction. 
Of course, the distributions of the populations sampled should be more or less 
normal and the sampling should be random. It follows from the above passages 
of Y ates that if the requirements above are satisfied and no new principles 
accepted, then we have to deny that there is any possibility of making fiducial 
statements about If so, then the nature of the latter is different from thpse 
involved in the application of the theory of confidence intervals.' 

Biometrika xxxii lo 
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The comparison of the above comments by Yates with those of Fisher gives 
a curious impression. Where Yates sees so many difficulties and restrictions, 
Fisher mentions none. Yet this very publication of Yates is fully endorsed by 
Fisher (19396). 

(ii) Numerical differences between the two theories. Besides establishing the 
existence of conceptual differences, it is essential to show that the two theories 
may give different numerical results. We may conclude from the discussion above 
that the application of confidence intervals requires fewer restrictions. But there 
is a logical possibility that, when both theories are applicable, they give the same 
numerical result. The following example shows that this is not the case and that 
fiducial limits need not satisfy the definition of confidence limits. 

The example that we are going to discuss refers to the problem of estimating 
the difference, say d, between the means of two populations of which it is known 
only that both are normal. Denote by 

®2,1> ^2,a> •••> J 

two random samples to be drawn from these populations and let n^n'. The 
confidence limits for d have been very elegantly obtained by Bartlett. He did not 
publish his results himself but they are briefly mentioned in a paper by Welch 
(1938). The tendency towards a greater generality of presentation resulted in 
certain complications. The foUowirig is a less general but simplified statement of 
the results.* Assume that the a:’s in (13) are numbered in the order in which they 
will be given by observation. Otherwise, randomize the second series. Next 
calculate n differences 

= *i.i-* 2 ,i {i = 1> 2, ...,n). (14) 

If ^(a:i_{) = 5 + ^ and = 6, then ^(%) = d. If the s.d.’s of the two 

populations sampled are tr and cr ', then the s.b. of will be ( 0 "® + cr'^)!. The con- 
secutive u’s wiU be normal and independent and the problem of estimating the 
difference between the means of two normal populations will be reduced to that 
of estimating the mean of one population of the u’a. Its solution is given by the 
confidence interval 

u — 8%^), ( 15 ) 

where 8 has an obvious meaning and is to be taken with to — 1 degrees of 
freedom. 

Again, an experiment consisting in repeated sampling of pairs of normal 
populations will show that, whatever be d, 8, cr, cr', whether constant or varying in 
an absolutely arbitrary manner, the relative frequency of cases in which the 
statement about 8 in the fqrm of ( 1 5) will be true wfll be approximately equal to a . 
The above solution of the problem, elegant as it is, is only a partial one. The results 

* Apart from these, the same author has obtained certain relevant results referring to the case 
where » = ■»' = 2 (Bartlett, 1936). 
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of Bartlett do not tell us whether the family of systems of confidence intervals 
found by him exhausts all the possibilities and whether it is possible to construct 
intervals which would be, in one sense or another, shorter than those given by 
(15). These are interesting and important problems and we may hope to have 
them solved. 

A result in fiducial theory corresponding to, but not equivalent with, formula 
(15) has been published by Fisher (1936); 


Let us suppose that a sample of n observations has yielded a moan, x, and an estimated 
variance of the mean, s®, so that = S(Xi—xYln(n— 1) ; then we know that if /« Ls the mean 
of the population 

/i — x+at, (16) 

where t is distributed in ‘Student’s’ distribution. Similarly, for the mean of a second 
population, of which we have n' observations, we may write 

p,' = x'+8V, (17) 


where t' is distributed in ‘Student’s’ distribution with n' — \ degrees of freedom, inde 

pendently of «. If now « . , 

^ = x'-x = d, 


we find that 


i—d = 


(18) 

(19) 


and since, s' and a are known, the quantity represented on the right has a Icnown distribu- 
tion, though not one which has been fully tabulated. The equation may he written 


e = V(s'‘ 4- s'*) {t' cos J2 — i sin JB), (20) 

where tan J? = s/a', so that 2? is a known angle. If t and t' be taken as the co-ordinates of a 
point on a plane, the frequency of the observations falling within any area of the plane is 
oaloulablo. The points for which e has any given value lie on a straight line, at a distance 
from the origin ± e/(s^-f s'*)*, and making an angle It with the axis of t. The fiducial prob- 
ability that e exceeds any given value is the frequency in the area above this line. If n 
and n' are both increased, the distribution of e tends to be normal and independent of ZJ ; 
when R is O'" or 90“ the distribution is of ‘ Student’s ’ form. In general it involves n, n', and 
R and for any chosen probability, therefore, requires a table of triple entry. 

As the reader will notice, no restrictions are mentioned and it is not suggested 
that for the practical application of the results any assumption is needed con- 
cerning the variability of the variances of the populations sampled. Neither is 
there any suggestion of any new principle that may he involved. We will return 
to this point below. 

Following the publication of Fisher just quoted, and on his advice, Sukhatme 
published a table (Sukhatme, 1938). The quantity tabled may be denoted by 
f{n,n',E) and represents the root of the equation 


/•-foof (• + «> 


H{t')dt'\dt = 0-026, 


(21) 


where G{t) a,ndH{t') are ‘ Student’s’ distributions with n— 1 and n' — 1 degrees of 
freedom respectively, while 

( 22 ) 


K = 


(s® -fa'®)* COS B 


10-2 
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It follows from the context that/(«, iJ) so calculated is the value such that 
the fiducial probabil% of its being exceeded by \6\j{s^ + s'^)^ is equal to 0-06. 
In other words, the values /(w, n', B) are the fiducial 5% limits of | e\l{s^ + s'^)K 
As e = 5-d, if the presumption that the fiducial limits necessarily lead to 
confidence intervals be true then this means that the double inequality 

x’~x-J(n, n\ B) 4S^x' ~x+fin,n' , B) yl(s^ + s'^) (23) 

must be the confidence intervals for d = /*' — /i. But it is easy to see that the 
functions on the extreme parts of (23) do not satisfy the conditions, explained in 
§ 3 above, necessary and sufficient for them to be the confidence limits. Take 
5 = 0 and denote simply by A the region in the space of the *’s including all the 
points in which the inequality (23) is satisfied. Take the probability law of the 
a’s and put 5 = 0 in it, that is, y' — y. It will be seen that the integral I{A)oi this 
probability law taken over A depends on the ratio p = tr/tr' of the two tr’s appro- 
priate to the two populations sampled and, thus, that it does not satisfy the 
identity (S). 

Condition (23) defining the region A does not involve the particular k’s but 
only the means x, x', and the variances and s'®. Consequently, to calculate 
I{A) we may start with the probability law of those four variables 


p{x,x',a,3') 


■n/r'n' 


<T^<T 




n(x — /i)^ n{n-\)s^ 


1 

X expj^ ^ 


,'2-| 

- , (24) 


where c is a purely numerical constant and does not involve any of the parameters. 
This function must be integrated over the region A defined by (23) or by the 
equivalent inequality 

1 ‘7*^ — jy? I 

(26) 




In dealing with it, we have to remember that B is not a constant but is connected 
with.« and s' jDy the equation tan JS = sis'. The required integral, or probability, 
of*, s, anda' satisfying (26) wdll be more easily calculated if we introduce a new 
system of variables, m, v, JR, and Sq. These will be connected to the old system as 
follows: 

X = pt + us^sinB,' 
x' = p + ?;s,)COS B, 
a = Sfl sin B, 
s' = SeCosU. 

The J acobian J of the transformation is easily found to be 

J = ^siniJcosiJ. 


(26) 


k 


(27) 
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The limits of variation of the new variables are as follows: 


— CO<U,V< +00, 

0 ^8o, 

0 < jB < 

The probability law of the new variables will be 


p(u, V, Sq, i?) = ——i-, S q+^'~^ sin ’^-^ R cos iJ, 

(X^O’ " 

... ,, nu'^&m‘‘R n'v^cos^R n(n—l)Bin^R n' In' — l)cos^ R 

with = s~“+ ?» — - + — V + — ^^ A . 

cr® o-'2 cr^ cr'2 

Inequality (26) will be equivalent to 

I D cos i? — tt sin ^ I <f(n, n\R). 


(28) 


(29) 

(30) 

(31) 


As this does not involve the integration with respect to this variable can be 
carried out within the extreme limits of its variation. As a result further integra- 
tions may be performed on the probability law of u,'v, R, 


/*oa 

p{u, v, jR) = 1 ^ p(u, V, Sq, R) ds„ 


c sin'‘~^ R cos”''“^ R 


ntr'n' 


(T"(r 


^n+n’ 


(32) 


where c is again a numerical constant. 

Further integration may be conveniently carried out as follows. Substitute 
a new variable z for the variable v so that 


V = 


z + u sin R dv 


cos J? 


dz cos R ' 


(33) 


Keep z constant within the limits [ z | ^J{n, n', R) prescribed by (31) and integrate 
for u from - oo to + oo. The result is 


-S) 


c sin’*'"® R cos'‘'~^ R 

Q-n~lg.'n,'-l + wV^) 


nn,' „ ?i(n— 1) . Ti'(n' — 1) » r,) 

z^ + - - o - sin^ R + o— - CQS^ .fi . (34) 


nc ^ + n (T 




The integration is completed by an easy substitution for z 


1(A) = 

'inf 




gij)^7l-2 003’'''“® R 


{n{n — 1 ) sin^ R+n\n' — l)p^ cos^ iJ)4(»+™'-2) J ^ (i ^2 


dz 

■'-2) Jo (l + z2)«"+«-'-i)r"’ 


(36) 
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with f — f{n, n', B) and 




wcr® + K,cr7\ cr^ cr® / 


( 36 ) 


By inspecting (36) it is more or less evident that I{A) must depend on the value 
of p. However, to avoid any doubt in this respect, it was thought useful to 
calculate I {A) for a few values of p. This was done by Miss Elizabeth Scott of the 
Statistical Laboratory, University of California, and it is a pleasure to record the 
author’s indebtedness to her. The calculations involved supplementing the tables 
of Sukhatme for a denser set of values of B. The calculated values of I{A) are; 


71 = 12, n' — C 


p 

1 ( A ) 

0-1 

0-066 

1-0 

0-960 

lO'O 

0-934 


Thus the functions representing the fiducial limits for S do not satisfy the condi- 
tions necessary and sufficient for them to be the confidence limits of the parameter 
in question. It follows that if pairs of normal populations forming a long sequence 
are sampled and the extreme parts of the double inequality (23) calculated, then 
the relative frequency of cases where the prediction of the value of 8 by means 
of these inequalities will be correct need not be equal to the expected 0- 96. It will 
depend on the value of p and, if this is uncertain, this frequency will be unknown. 
Subsequent comments by Fisher (Fisher, 1939 a) seem to indicate that the fre- 
quency in question is expected to approach 0-95 only if the ratio p is not constant 
but follows a certain fiducial distribution. It is noteworthy that no such restric- 
tion is to be found in the original work quoted above. On the other hand, it is 
more or less in line with those restrictions formulated by Yates. 


5. Views oe M. S. Babtlbtt and R. A. Fishee 

The controversy in which the main contributors are Bartlett (Bartlett, 1936, 
1939) and Fisher (Fisher, 1937, 1939a, 19396) seems to be based on a mis- 
understanding. Presuming that the fiducial limits are always equal to confidence 
limits, Bartlett was puzzled by Fisher’s results concerning 8 just quoted, and 
suspected an error. The subsequent elaborations by Fisher and Yates amount to 
a confirmation that the values of/ (», n' , R) as tabled by Sukhatme do not provide 
the confidence intervals. But both authors are emphatic that there is no error in 
the original deductions, and that Bartlett misunderstood the problem. It is 
unthinkable that these four unanimous papers are mistaken and, therefore, we 
must accept the conclusion that the presumption of intrinsic identity between 
fiducial and confidence limits is unfounded. 

But it must be pointed out that, before the appeal to extra-logical principles 
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was published, there was much to be said in favour of the opinion that the solution 
of Fisher, as quoted above, and the work of Sukhatme both involved errors in the 
algebra of probability laws. It also seems that, apart from establishing that the 
fiducial theory and the theory of confidence intervals are distinct, it will be of 
some interest to analyse Fisher’s work in detail and to point out exactly where 
and how it diverges from the rules of ordinary theory of probability on which the 
theory of confidence intervals is based. 

When a system of observable phenomena is treated mathematically, it is 
essential to be clear on exactly what is assumed as given or as known. For example, 
when trying to calculate the area of land from a certain set of measurements, it is 
essential to be clear as to assumptions made concerning the shape of the land 
considered. The available data may be consistent with a number of such assump- 
tions, e.g. that the surface considered is a plane or that it is spherical with a given 
radius, etc. Whichever of these hypotheses is accepted as given, the applications 
of the appropriate formulae will give mutually consistent resxilts. But they would, 
not generally be consistent if one part of the calculations were made on one 
hypothesis and another on a contradictory one. The differences may be small, but 
in mathematics there are really no ‘ small ’ nor ‘ large ’ inconsistencies. There are 
simply inconsistencies. Needless to say, the choice of exactly what is to be 
accepted as given must be made to attain the greatest conformity with empirical 
facts. But this is a question which need not be discussed here. 

The above general principle also applies to the applications of probability. 
There we must be clear as to exactly what are the phenomena or the variables 
which we agree to consider as random in a given inquiry. In practice, of course, 
the random variable will be the one whose value at the moment is uncertain and 
is being determined ‘by chance’. If X is considered as a random variable, the 
premises of the mathematical problem must include some assumptions as to the 
relative frequencies with which X assumes its possible values. These assumptions 
may vary in specificity, but they must be present in the premises. 

Any number or variable which is not random must be clearly recognized as 
such. For some time such non-random numbers were called constants. This was 
more or less satisfactory with constant numbers. But Fr6chet (Fr6chet, 1937) has 
noticed that we may also consider variables which are not random and has in- 
vented useful terms to describe them. These are ‘nombre certain’, ‘fonction 
certaine ’ , etc. We will translate these terms by ‘ sure number ’ and ‘ sure function ’ . 
The thousandth digit in the expansion 7r = 3-1415...isa sure number, although 
totally unknown to me. Denote by/(n) the relative frequency of O’s among the 
first n digits of the same expansion of tt. This will be a sure function. On the other 
hand, if (j){n) denotes the number of errors that may be made when calculating tt 
to n places of decimals, then ^{n) may be considered as a random function of n. 
Considerations of this kind would imply those of a considerable sequence S of 
similar attempts to calculate ir, by the same person or by different persons of a 
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specified category, in which the values of f>{n) will vary, as we shall say, at random . 
It is with respect to just such a sequence of determinations of the values of the 
function <j){n) that our probability statements will refer. For example, if we either 
start or finish our calculations with the probability equal to 0-25 of g5(n) being 
between any two sure numbers a, b, then the applicational statement is that about 
25 % of the numbers of the sequence S satisfy the inequality a <^{n)<b. 

It is important to notice that the sequence S may consist of just one member; 
then all the proportions relating this ‘sequence’ will have to be either 0 or 1. In 
other words, if the sequence of ‘random’ determinations consists of just one 
element, this element will have the property of a sure, not a random, object, in the 
usual sense of the word. 

Now let us turn to the passage from Fisher’s paper quoted above, p. 136, and 
try to see exactly what is supposed to be random there and what elements of the 
problem are treated as sure numbers or sure functions. These details in the set-up 
are not stated at the outset, but there is no difficulty in collecting them from 
appropriate passages in the paper. We first see that the function t of (10) is sup- 
posed to be ‘distributed in different samples...’. This means that i is a random 
variable and that its randomness depends on what is found in those repeated 
samples, namely, the values of $ and s. It follows that the probabilities concerning 
X, s, and t refer to the sequence S of those ‘different ’ samples. The sequence could 
not consist of just one sample because, in such a case, the ‘ distribution’ of t would 
not be anything like ‘Student’s’ law. The references to a normal population 
sampled and to ‘ Student’s ’ law indicate, on the contrary, that the sequence S 
of samples is very large indeed, and that the distributions in it are comparable to 
those represented by continuous curves. 

Up to this time we have, not mentioned the population mean y which is also 
involved in the expression of {. Obviously, this may be treated mathematically 
either as a random or as a sure number. Both methods of approach are at our 
disposal but, in order to avoid inconsistencies, we must be clear as to which one 
we follow. The indication of Fisher’s choice is found a little further on in this 
article, in the place describing the distinction between the fiducial and the inverse 
probability approach: ‘It is of some importance to distinguish such (fiducial) 
probability statement about the value of y, from those that would be derived by 
the method of inverse probability from any postulated knowledge of the distribu- 
tion of (i in the different populations which might have been sampled.’ This 
sentence does not seem to leave any ground for doubt. In the fiducial approach 
we consider but one population sampled and no distribution of is postulated. 
Therefore, /t is a sure number and, if t is distributed according to ‘ Student’s ’ law, 
it is a result of the appropriate variability of x and a alone. 

The symbol which also comes into play, is obviously a sure variable capable 
of any real value between — oo and -f go. We may select it as we wish and then 
obtain the probability P(h) of the random variable t exceeding from tables. 
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Following the article , we will readily agree with Fisher that the inequality (11), 
namely, fi<x — stij^n, is equivalent to £ > h and that it must he satisfied with some 
probability P(£i). Now consider the phrase; ‘Since, therefore, the right-hand side 
of the inequality (i.e. x — stj^n) takes, by varying ah real values, we may state 
the probability that fi is less than any assigned value, or the probability that it lies 
between any, assigned values, or, in short, its probability distribution in the light 
of the sample observed.’ From the point of view of ordinary logic and of ordinary 
theory of probability this phrase is inconsistent with the original set-up. The first 
inconsistency is involved in the words which are italicized, suggesting that x and 
s in the expression x — st-j^l^fn are not random but sure numbers, referring to one 
particular observed sample. As a matter of fact this same inconsistency appears 
earlier in the statement that 5£j/„yn, by varying h, will run through all real 
numbers. If, as former^, x and s are random with their variation appropriate to 
the sequence S, then, whatever value we choose to ascribe to say £ = 2, the 
expression x - 2s j^Jn is also random and depends on the outcome of sampling. 

Apart from this sudden shift in the meaning ascribed to x and s, there are two 
more inconsistencies. To see the first of them, let us follow Fisher, changing our 
minds about x and s and considering them as sure numbers, determined by one 
particular sample. In this case the inequality ij,<x — stJ^n would contain no 
random elements at all: the first element, p, is an unknown constant, the mean of 
a single population sampled, x and s are fixed by the sample observed, and t^ 
is the value of the sure variable that we have chosen to consider. In these oiroum- 
stanoes, the inequality may either be true or not true and the probability of its 
being true will equal unity or zero and have nothing to do with the probability 
or frequency P(£i) which this same inequality satisfies within a sequence S of many 
‘different’ samples. 

The last inconsistency refers, of course, to the point of view on /t. As we have 
seen above, it is first considered as a sure number, but the passage just quoted 
speaks of the probability of its lying between any assigned limits possible to 
determine from the values of ■P(t). Assume n = i and that the sample observed 
gives a; = 10 and 5 = 2. Select t^ = 0-765 and £i = — 0-765 so that F(t^) == 0-25 and 
P(i() = 0-75. This would result in the supposed probability P' of lying between 
the limits 9-236 10-765, being equal to 1/2, Trying to interpret this result in 

the light of the classical theory of probability, we have to conceive a sequence, 
say S', of cases in 50 % of which n falls between the above limits. But exactly what 
could this sequence be? Either there is such a sequence and then we must also 
consider other populations ‘which might have been sampled’, and postulate some- 
thing about the distribution of p*, or else the ‘sequence’ must be the degenerate one 
of one element only with the probability P' equal to either zero or unity, hut 
never to 1/2. 

These are the points previously mentioned by the author (Neyman, 1934), 
* This is quite essential. Otherwise there would be an error in Bayes’s theorem. 
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which, from the point of view of classical probability, represent conceptual in- 
consistencies. They are also present in the other passage of Msher quoted on 
p. 139, but a similar analysis of that passage supplemented by what has subse- 
quently been done by Sukhatme, will reveal errors in algebra of probability laws 
as well. These errors are particularly relevant from the point of view of the contro- 
versies between Bartlett and Eisher. 

The quantities considered in this passage are all dependent on the population 
means /< and f/,' and on the statistics x and a of one random sample and on x' and 
a' of the other. Our analysis will also require the consideration of the population 
variances n® and (t'®. We must start by deciding on the random or sure character 
of all these quantities. Fisher’s remark that the two ratios 


t 


^ and 


(37) 


are distributed according to ‘ Student’s ’ law with appropriate degrees of freedom 
suggests that y and y' are treated as sure numbers and that x, x', a, and a' are 
random. There is no reference whatever to the variances cr® and cr'®. As nothing 
is disclosed about what distribution they may possess,, by analogy with the y's 
it is natural to treat them as sure numbers also. 

In order to interpret every step in calculations more easily, we shall imagine 
two normal populations Tti and rr^ sampled and a sequence A. of pairs of samples, 
of n and n- individuals respectively, drawn independently from and n^. These 
pairs of samples will detebmine x, a, and generating distributions appro- 
priate to normal populations. Substituted into formulae (37) they will make t and 
t' Vary to generate the two distributions of ' Student’ . 

With this iu mind, let us examine the passage in which Fisher writes 

e = d~d = a't' — at, (38) 

and comments; ‘ Since a' and s are known, the quantity represented on the right 
has a known distribution, though not one which has been fuUy tabulated,’ We 
see here the same kind of sudden jump hi the point of view on quantities con- 
sidered as is found in the passage analysed previously. Formerly s' and a were not 
‘ known ’ but random. Otherwise, the distribntions of t and t' would not have been 
those of ‘ Student ’ but would have been normal about zero and due solely to the 
variability of $ and x'. Now s' and s are known sure numbers. Let us allow for this 
shift in conditions and try to visualize the character of the distribution of e for 
fixeds' ands. For this purpose we have to consider not the whole sequence ^4 of pairs 
of samples mentioned above, hut only a subsequence B composed only of those 
pairs of samples in which the estimated variances have the same values a and s' as 
the ones supposed to be ' known ’. The variability of e in the subsequence B will be 
the result of the variability of x and *' only. It is known, that the mean of a sample 
from a normal population is independent of the sample variance. Consequently 
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the distributions of x and x' in B wOl be normal. As the connexion between e on 
one hand and x and x' on the other is linear with constant coefficients, it would 
follow that the distribution of e in B would be normal also. Therefore, it is with 
some surprise that one reads Fisher’s suggestion that this distribution has not 
been fully tabulated. Evidently, when writing the sentence quoted, Fisher had 
something else in mind, probably depending on the new extra-logical principle 
described in subsequent publications. However this may be, we have to note the 
conflict between the sentence quoted and the rules of ordinary logic and of the 
classical theory of probability. 

The distribution of e by itself does not play any further role in Fisher’s work. 
Instead he and, subsequently, Sukhatme consider the ratio that we will denote by 
z — s'®). Fisher does not write any formula representing the supposed 

distribution of z and we have to look for the details of his ideas m Sukhatme’s 
paper. Complimentary references to this paper in subsequent publications by 
Fisher suggest that it is perfectly in Une with his own ideas. We quote the relevant 
sentence in Sukhatme’s paper, only altering his notation to bring it into agreement 
with that of Fisher, 


He (Fisher) considers the distribution of 


z 




COS B — t sin B, 


(39) 


for given n, n', and R in order to obtain the probability that z exceeds any given 
value. 


It is obvious at once that the probability in question does not refer to either of 
the sequences A or J5 visualized above. The appropriate sequence 0 of parrs of 
samples to which this probability refers is a part of the sequence A composed of 
aU such pairs of samples in which the variances and s'®, while variable, keep the 
ratio 8 1 s' = tan B = constant. Mathematically, the distribution sought is known 
as the relative distribution law of zgiven B and is denoted by p(z ] B). If p{B) and 
p{z, B) are the absolute probability law of B and the absolute joint probability 
law of z and B, respectively, then, for every B such that p(F) > 0, 


p(z 1 B) = 


p(B) ■ 


(40) 


The relative probability, given B, of z exceeding a fixed number z^, that is 
P(z > I i2), wifi be obtained by integrating (40) for z from to + oo. There is an 
alternative way of obtaining the same probability. This consists of first finding the 
relative joint probability law given Boft and t'. If this is denoted by p{t, t' | B) 
then 

P{z > Zj, 1 E} = f {* p{t, t' I B) it At', 

J J «J(zi) 


( 41 ) 
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wheie the region of integration w{Zi) is determined by the inequality 

z = t' C 06 jR'-tamB>Zi. (42) 


A familiar formula gives 


I B) 


p{R) 


(43) 


Whichever way, (40) or (43), is preferred, the resulting probability P{z > 2i ( ^} 
will have the sanae value and will refer to the sequence C described above. 

Sukhatme has chosen to apply a quadrature procedure to calculate the integral 
(41) with the integrand equal to the product of t wo of ‘ Student’ s ’ distributions with 
n - 1 and n'-l degrees of freedom respectively. This is just the error in algebra of 
probability laws mentioned above. The t and 1' are distributed independently and 
in accordance with ‘ Student’s ’ laws only in the sequence A where both the means 
E and x' and also the variances and are undisturbed in their random and 
independent variation appropriate to samples from normal populations. When 
calculating the probability ‘for a given R’, we do not consider the sequence A 
but only its part C so selected that the ratio sjs' is constant. This selection disturbs 
the original distribution of a and s' and is reflected in the resulting joint distribu- 
tion of ^ and 

In our calculations above (26) we have used the letters u and v for what is here 
denoted by f and t'. Consequently, the joint probability law p{t, t', R) is obtained 
from (32) by merely substituting t for u and (' for v. The absolute probability law 
of R ia easily obtained by integrating (34) with respect to z between the limits 
-00 and +oo. The result is 


P sin"~^ R co8”'~* R 


(44) 


with c denoting a numerical constant. Substituting (32) and (44) into (43) we 
obtain 


p(tJ'\R) 


mp) 

{»(<*-)- W— l)sin*iJ-t-»'(J'®-t-»' — 1) /)* COS^ Jiji(n+n')’ 


(45) 


with f>{R, p) denoting a function of R, p, n and n' only. p{t, <' [ JR) is just the func- 
tion to be integrated to obtain the relative probability given Boft and t' to verify 
any inequality such as «' cos iZ - i sin ^ > 2 i. As one would expect p(t,t’\R) 
appears to depend not only on B but also on the ratio of the population vari- 
ances p®. 

It follows that, from the point of view of the ordinary theory of probability, 
the Fisher-Sukhatme solution ia wrong. The error consists in their confusing the 
absolute probability law of t and obtainable by integrating (32) for R, with the 
relative probability law given R of the same variables as given by (46), Some such 
error seems to have been suspected by Bartlett. Repeated denials and the re- 
ference to the extra-logical principle underlying the fiducial theory lead us to 
believe that from the point of view of that particular theory the error is non- 
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existent. While accepting these explanations we may still regret that the earlier 
papers by Fisher and that of Sukhatme do not contain any clue as to how they 
are to be interpreted. 


6; StrMMARy 

1 . The theories of fiducial argument and of confidence intervals differ in their 
basic conceptions. The validity of the former requires, at least in some oases, the 
fulfilment of various restrictions of which the theory of confidence intervals is 
totally free, and/or the acceptance of some new principles impossible to deduce by 
the rules of ordinary logic (Yates, 1939; Fisher, 19396). 

2. The two theories may occasionally give the same numerical results in the 
form of fiducial limits on one side and of confidence limits on the other. The pro- 
blem of estimating the difference of means of two unknown normal populations 
shows, however, that this need not always be the case and that fiducial limits need 
not satisfy the definition of confidence limits. 

3. Bartlett’s criticisms of Fisher’s solution of the problem just mentioned 
seem to be due to his considering the problem from fhe point of view of ordinary 
theory of probability and ordinary logic. In this light Fisher’s solution does 
contain both conceptual misunderstandings (originally pointed out in the author’s 
paper of 1 934) inherent in the very concept of fiducial distribution of a parameter, 
and errors in algebra of probability laws. Since the first references to the new 
principles outside of ordinary logic, which supposedly justify the fiducial theory, 
were published after the publication of Bartlett’s criticisms, the latter seem to be 
perfectly justified and useful. 

4. Owing to a certain flaw in the ideas underlying the fiducial theory which is 
noticeable in passages quoted in § 4, it is impossible to insist on any definite 
attitude towards it, except that of doubt. It may be useful, however, to express 
the following conjectures which seem to be very probable. If they are wrong then 
they will be put right and, as a result, the situation will be clarified. 

The present author is inclined to think that the literature on the theory of 
fiducial argument was born out of ideas similar to those underlying the theory of 
confidence intervals. These ideas, however, seem to have been too vague to 
crystallize into a mathematical theory. Instead they resulted in misconceptions 
of ‘ fiducial probability ’ and ‘ fiducial distribution of a parameter ’ which seem to 
involve intrinsic inconsistencies as described in § 5. In this light, the theory of 
fiducial inference is simply non-existent in the same sense as, for example, a theory 
of numbers defined by mutually contradictory definitions. 

In earlier stages when- the problems treated were very simple, the fallacy 
involved in ‘fiducial probability’ was not apparent. Later on, however, diffi- 
culties appeared and the new principle ‘which cannot be deduced by logic’ seems 
to have been invented to disentangle them in one particular case. But the word, 
‘principle’ implies some. generality, hence the drift in comments on the samh 
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subjects treated in 1936 and again in 1939. From the point of view of the direction 
of this drift it is perhaps significant that Yates speaks of ‘fiducial statements 
possible to make on the ground of probabilities a posteriori and that the paper by 
Jeffreys which professes the equivalence of fiducial theory with that of inverse 
probability appeared in the Annals of Eugenics^ edited by R. A. Fisher. 

However this may be, the only thing that the present author ventures to 
profess is that the theory of fiducial probability is distinct from that of confidence 
intervals. 
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The Incomplete Beta Function ratio has been defined as 


4(P.9') = •fix(P.2)/^(P.3) 


J'(p + g) 


( 1 ) 


When the fundamental Tables of the Incomplete Beta-function (C) were pub- 
lished by Karl Pearson in 1934 it was realized that they might form a basis for 
shorter tables suited for use in special problems. One such application is in con- 
nexion with sampling theory and the associated significance tests. In carrying 
out these tests it is generally considered that a table giving values of the argument 
corresponding to certain convenient probability levels is more useful than one 
in which the probability integral is listed at equal intervals of the argument. 
Using the transformed variable 


2 = ilog, 


qx ’ 


( 2 ) 


R. A. Fisher (3) was the first to provide tables of this character, giving values of 
z associated with the 0’06 and 0*01 probability levels. Since then, a table for the 
0-001 level has been calculated by Oolcord & I)eming(2) and one for the 0-20 level 
by H. W. Norton (in Tables edited by Fisher & Yates (4)). In the terminology of 
the analysis of variance, if 

is a sum of squares depending on degrees of freedom, and 
8^ is a sum of squares depending on degrees of freedom, 

then 

and I'l = 2q, Va = 2p. (4) 
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For tests of significance, z is easily computed and the fact that, when v-^ and 
are large, it tends to be normally distributed about zero with a standard 
deviation m/i 

( 3 ) 




lent considerable weight to its tabulation rather than that of x. Experience has, 
however, shown that in a number of problems it is the percentage levels of x of 
the Beta-distribution that are directly required; this fact and the desirability of 
having available a greater number of percentage levels* are reasons for the issue 
of the present tables giving five significant figures for x. They may be regarded 
as a supplement to the original 1934: Tables 

Conversion from x to or to 8nedecor’s(7) F, where 


Vj/Sg qx ’ 


( 6 ) 


is straightforward. Tables of the seven percentage levels for F have, in fact, 
been already computed, and it is hoped to, include them in a new edition of 
Tables for Statisticians and Biometricians. 

Since the completion of the marginal columns for the tables of F involved some 
fresh computation, it seemed useful to extend the work so as to provide new 
tables giving thirteen percentage levels for x^- These tables are printed in a 
separate contribution on pp. 187-191 below; they have been calculated to six 
significant figures and cover the range of degrees of freedom v = 1(1)30 and 
40(10)100. 

A word of comment is perhaps desirable as to the introduction of the notation 
V, and for degrees of freedom in place of the customary n, n^^ and n^. The use 
of the letter n, both with and without a subscript, to denote a gvon'p frequency has 
beeiv so long estabhshed in publications associated with Biometrika and elsewhere 
that it seemed desirable in these tables to avoid confusion by adopting a fresh 
symbol for degrees of freedom . The letter / has sometimes been used, but the 
notation is not altogether satisfactory; the letter v is that eihployed by Yule & 
Kendall ((8), p. 415), and its use here should be free from any ambiguity. 

Reference has been made above to the existence of problems where the direct 
requirement is for the percentage levels of x rather than zov F. A ease in point is 
that of the multiple correlation coefficient in samples from uncorrelated normally 
distributed material; here follows exactly the Beta distribution. In other 
cases the distribution may be used to give an approximate fit to probability 
functions whose exact equations are either unknown or difficult to handle. Thus 
in his Preface to the Tables of the Incomplete Beta-function, Karl Pearson stated 
that his first interest in the function was stimulated by the discovery of how 
accurately it could be made to graduate a hypergeometric distribution. The 

* The percentage levels tabulated are: 60, 26, 10, 6, 2'6, 1 and 0-5. 
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fitting was carried out by equating the first four moments of the Beta and hyper- 
geometric distributions. 

Again, if a random variable can assume only values between 0 and 1, if it has 
a mean value of and a second moment about zero of then the probability law 

where P = (8) 

will often give a very close approximation to the true law. Use has been made of 
this fact by Neyman & Pear8on(5), Bishop(i) and others in determining prob- 
ability levels for the likelihood ratio criterion used in testing the homogeneity 
of a series of variances and covariances. The accompanying tables are directly 
applicable in such problems. 

Miss Catherine M. Thompson (now Mrs V. G. Grylls) has been responsible for 
by far the greater part of the numerical work involved in the production, and the 
tables should rightly be associated with her name. Owing to the special character 
of the Beta-distribution, which makes it necessary to vary the method of com- 
putation in different parts of the range of variables covered, a considerable 
amount of exploratory work and some careful planning was needed in the 
development of the lines of attack. This essential aid has been provided by 
Drs L. J. Comrie and H. 0. Hartley of Scientific Computing Service Ltd., in. 
whose office Mias Thompson carried out. most of the work. Since the evacuation 
of University College at the beginning of the war this help, both in advice and in 
accommodation, has been more than ever essential. In the following pages 
Urs Comrie and Hartley have described the various methods used in computation 
and have also discussed the problem of interpolation. 

The Editor is glad to take this opportunity of expressing his warm appreoia- ' 
tion of this collaboration, which has made it possible to carry through to a suo- 
oessful conclusion a piece of work that bad long been in view. 
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Pe/rc&fhtage points of the iTicomplete hetci-function 

DESCRIPTION OF THE CALCULATION 
By L. J. COMRIE and H. 0. HARTLEY 


iNTRODirOTIOir 

Ikt terms of Karl Pearson’s notation the incomplete Beta-function Ji^{p, q) is 
defined by the integral 

~j x'»-\l-x)^-^dx. ( 1 ) 

For a: = 1 we have the complete Beta-function B^{p, q), commonly denoted by 
B(p, q) and defined by the equation 


B{p,q) 


np)m 


( 2 ) 


r^p + q) ’ 

which is identical with (1) for a; = 1. 

The tables give the percentage points of the ‘normalized’ incomplete Beta- 
function no- 4 -oi f* 


( 3 ) 


They are defined as the roots x of the equation 

Up,q)^P ( 4 ) 

for given P, as functions of the parameters p and q. Seven tables have been pre- 
pared corresponding to seven selected values of P, namely 0-005, 0-01, 0-025, 
0-06, O-IO, 0-26 and 0-60. From, the formula 


1— (S) 

the roots of (4) follow immediately for P = 0-75, 0-90, 0-95, 0-975, 0-99 and 
0-095. 

In each table x is tabulated for 


= 2g = 1(1)10, 12, 16, 20, 24, 30, 40, 60, 120 and oo 
Vj = 2p = 1(1)30, 40, 60, 120 and oo 

With 2q as column heading and 2p as row headings the arrangement of the 
tables corresponds to that of the upper percentage points of R. A. Fisher’s z and 
G. Snedecor’s F, Vi and being the degrees of freedom. 

Karl Pearson in his introduction to the Tables of the Incomplete Beta-function (O) 
says: ‘No single method has hitherto, been discovered for evaluating numerically 
the incomplete Beta-function for aU values of p and q.’ Those who have done 
numerical work on this function and its various transformations will agree that 
the main difficulty is the limitation in scope of any single method and the variety 
of methods required to deal appropriately with the range of the parameters p 
and q and of the variable x. This difiicnlty is enhanced when the task is the 
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calculation of percentage points of x rather than the tabulation of the function 
large number of numerical processes, each specially designed to deal 
with certain ranges of the new tables, had to be employed to accomplish this task. 

The choice of a suitable method is largely determined by three important 
factors: 

(а) The accuracy required for x. This was fixed at five significant figures. 

(б) Existing tables available as a starting point. These are: Karl Pearson’s 
Tables of the Incomplete Beta-function{i). Eisher’s tables of percentage points 
of z(i), and corresponding tables of its transformation F{3), arid finally Karl 
Pearson’s Tables of the Incomplete F-f unction (5). 

(c) The number and relative position of percentage levels, P, and the values of 
p and q for which the percentage points x are to be calculated. 

The importance of (a) and (f>) is obvious, but (c) is no less relevant. It will, as 
a rule, be uneconomical to produce an interpolable table of I^{p, q) merely to 
obtain a single i^eroentage point by inverse interpolation. If, however, a larger 
number of percentage levels is to be calculated the method becomes worth while. 
In this connexion it should be remarked that the original plan was to produce 
tables for P = 0-006, 0-01 ; 0-05 and O-lO only, and that it was decided at a later 
fStage to add tables for the remaining values of P. 

Summary or numerical methods employed 

(1) Inverse interpolation in Karl Pearson's tables. A large number of per- 
centage points were obtained by inverse interpolation in the tables of 

The particular tables required were differenced on a National machine (2) and 
six significant figures of x found by the method of inverse interpolation described 
by L. J. Comrie(2), taking into consideration the higher order differences. The 
method breaks down when for large p and small q (or for large q and small p) the 
tabular interval of 0-01 is too wide for the tables to be interpolable. With the 
notation adopted, the difficulty arises in the top right-hand corner of the tables 
when for large q and small p the root x is smaller than 0-06. Roughly speaking, 
roots greater than 0-05 could be obtained from Pearson’s tables. 

(2) Interpolation in tables of percentage points. It will be noted that in the 
present tables percentage points are given for certain values of p and q for which 
the function 4(p, q) has not been tabulated by Pearson. Such points occur in the 
rows 2p = 23(2)29, 120 and in the column 2q = 120. Whilst the calculation of the 
two marginal lines (2g = 120 and 2p = 120) necessitated special methods, the 
extra entries in the interior of the table were easily obtained by p-wise inter- 
polation, using suitable formulae of the Lagrangian type. 

(3) Extension of the tables of I^{p, q). The well-known recurrence' formula 

•4(p>9') = (6) 

is particularly convenient if Ix{p,q) is required for a lattice work of integer 

ii-a 
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co-ordinates 2p, 2q in a certain range and for a limited range of x. For small 
and large or moderate q all percentage points of q) are clustered near 0. It 
appeared worth, while, therefore, to use the recurrence formula (6) to extend 
Pearson’s table hy producing 

4(P,3) for 2p = 1(1)7, 2g= 1(1)30 
and » = 0'001(0-00 1)0-012, 0-015 and 0-026 

in order to obtain more of the required percentage levels x by inverse interpolation 
at intervals 0-005 and 0-001 respectively.* This made it possible to obtain a 
large number of percentage points a: in a range where Pearson’s tables are not 
interpolable. 

To start the recurrence process, the functions 


4(1.2) and 4(?>.l) 

are required for the above ranges of p and q. The first two functions were obtained 
from the expansions 


4(i2) 



2(2-1) , 2(g-l)(2-2) . 
3 1! * ■*^6 2! 


2 {q~l){q~2)iq~3) , 
7 3! 


(’) 


. , 1 1.3a;»+2 i.3.6a^'+“ ) 

xKPA) - jB(p,^)(^'l'2.l!(p-H)'*’22.2!(p-t-2)'^2».3!(p-l-3)'^’")' 


In dealing with the expansion of 4(i>3)> terms required for 10-deoimal 
accuracy in Io-o 2 B(i> 3) were first produced and then reduced for smaller values of 
X by multiplying by the appropriate power of a;/0-025. The quantities 


4(1.3) = l-^(l-a:)®. = 

were produced with the help of logarithmic tables. The remaining functions 

4(1-5. 3). 4(2-5, 3). 4(3*5. 3) and 4(2,3). 4(3,3) 

were then obtained by four recurrences covering the following combinations of 
the parameters 2p and 2q. 

Odd values of 2p and odd values of 2q 
Even „ odd „ 

Odd „ even „ 

' Even „ even „ 

Having thus dealt with the main body of the tables we now turn to the more 
difficult problem of calculating entries x near the margin of each table of per- 
centage points. In what follows, methods will differ according to whether 2p is 
odd or even. 


* [It 18 hoped that at a future date it ^ ■> 
1 *( 3 >. 2 ) a supplement to the Tables of the . 


"ih these extended values of 
Ed.] 



Catherine M. Thompson 167 


(4) Building up the, polynomial part of Ix(p,g) from a constant high-order 
difference (2p even). For integer p, 

may be expressed as a polynomial in (1 - a;). We have 

2) - B^ip, 3) = (1 - a:)/s ( - Y (10) 

i=o g+h 

or, introducing y = 1 — jb, 

By{q, P) = {-Y (11) 

1=0 q + r 



By(q, p) is therefore the product of the gth power of y and a simple polynomial in y. 
If p is small (2p:^ 12) this pol37nomial can be built up easily on the National 
machine (2). As an example, for 2q = 60 and 2p = 6 we have the equation 
14880{5(3, 30) - 30)} = 1488015^(30, 3) = 2/30(496 - 960y + 465?/0). 

The polynomial 496 — 960?/ + 466i/® was built up on the National machine from 
its constant second difference for values of y beginning at ?/ = 1 and descending 
at interval 0-001. The polynomial values were multiplied by the 30th power of 
the argument and the products checked by differencing. Finally the percentage 
points y (or x) were found by inverse interpolation. This method was used for 
2q = 40, 60, 120 and 2p = 2, 4, 6, 8, 10, 12. For larger values of p the building 
up process becomes too laborious. On the other hand, with increasing p the 
percentage points x increase in value, so that for values of 2p greater than 12 and 
not exceeding 100 results could be obtained from Pearson’s tables by inverse 
interpolation. 

(5) Taylor expansion at approximate percentage point (2p even). It remains, 
therefore, to consider the last column 2q = 120 for 2p ^ 14. For small a: (i.e. values 
of y in the neighbourhood of 1 ) the terms of the expansion (11) have to be calculated 
to a very high degree of accuracy, since these terms have alternating signs and 
many significant figures are lost when adding to produce B,^{q, p), which is very 
small. Since seven significant figures are required for By(q, p), in some cases 20 
decimals are required for the terms in (11), and their computation becomes 
laborious. A method was, therefore, evolved whereby By{q, p) has to be calculated 
for one single three-decimal argument only. Although the function By{q, p) is 
difficult to compute, its derivatives are easily calculated. It is, therefore, natural 
to use a Taylor expansion 

By+jfq, p) - B.y{q, p) = %®-i(l - y)»-i jl -t + •••}■ (12) 


With a known* three-decimal approximation y to the exact percentage point 
y -^h the main task consists in the calculation of By(q, p). This was done from 
formula (11), using tables of powers(9), and a high capacity electric calculating 

* It will be shown later how this approximation was obtained. 
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machine. The correction, h to the approximation y was then easily obtained by 
iteration from equation (12), With 'h^ — O the iteration is given by 


h 


H+l 


where 


AB^ 


[B{p,q)P-By{q,p)'\ 


(13) 


Since the numerator, AB, of equation (13) does not vary, the corrections 
and Aa are easily produced in turn, three steps being sufficient in most cases. 
Occasionally the term arising from the third derivative had to be included in the 
denominator. In this way the percentage points were calculated for 


2p = 14(2)22 2q ^ 120, 

2q = 14(2)22 2p = 120, 

the values for 2q = 14, 16, 18, 22 and 2p = 120 being required for checking by 
differencing. 

A word has to be added concerning the three-decimal approximation to the 
percentage points of x for 2q == 120 and 2p ~ 120. In some oases such values 
could be obtained from Fisher’s table of percentage points of z using the trans- 
formation p 

* == — 

p + qe^ 


In cases where such values are not available they were obtained by harmonic 
interpolation. More precisely, the finite limits 

lim 25 fa; and lim2j?(l — a;) 
q->oo p^oo 

p — constant q — constant 

werejirst obtained from the functions I {u,p) of Tables oj the Incomplete P-f unction. 
The above limits depend on p and q and-are given by 2u ,jp and 2u fq respectively, 
where u is the root of ~ 1) = P and l{u,q-l) = 1 - P respectively. The 
quantity 2^a:, being known for the arguments 1/2^ = 0/120, 2/120, 3/120, 4/120, 
6/120 and 6/120, was then obtained (to about four-decimal accuracy) for the 
argument ll2q = 1/120 by a Lagrangian interpolation formula. Similarly the 
quantity 2j9(l -k) was calculated for lj2p ~ 1/120. 

We are left, therefore, to consider the entries in the column 2q ~ 120 with 
2p odd, and in the row 2p — 120 with 2q odd, and also certain entries in the top 
right-hand corner for 2q > 30 and odd 2p < 13. 

(6) Binomial expansion with fractional index (2p odd). In the top right-hand 
corner values of x are small and it is therefore to be expected that the expansion 

B{p,q)P 

i=0 P + ^ 

is reasonably convergent for such values of x. 


( 14 ) 
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The coeflS-cients of the expansion (14) were calculated for 2q = 1(1)30, 40, 60 
and 120, and 2p = 1(1)9. The root x of the equation (14) was then found by a 
suitable iteration process. 

In some cases {2p = l,1^2g'<30) it was found convenient to invert the ■ 
expansion (14). With 2p ~ 1 the equation (14), if regarded as an expansion in 
powers of >^Jx, may be reversed to yield ^Jx as an expansion in powers ofB{p,q)'x P. 
Because of the particular importance of the case 2^ = 1 (the i-distribution) it is 
of interest to give here example, s of formulae from which any percentage point x 
may be obtained directly by substituting the corresponding percentage level P. 
If D — ^B{p,q)P =: lB^{p,q), the first five terms of the reversed expansion 
are as follows; 

^X = D + ^ ^ ^ w - - -0° 

o ou 

, (g-l)(127g^-131g+34) 

630 

(g - 1 ) (4369?3 - 62855^2 q. 3042^ - 496) „„ 

+ 22gg0 +..., 

from which the expansion for any particular q may be worked out without 
difficulty. Thus for q = 10, 

V* = D+ 3DH 19-81)5 + i63-2i)’+ 1496-22)9+ ..., 

and for q = 25, 

^x = i) + 8I)5 + i36-82)5 + 2900-32)’ + 68162-0I)9 + .... 

(7) 'Numerical integration. For large p and q, when the integral 
approaches the normal probability integral, a variety of methods has been 
developed(6,7,9). With mechanical computing aids available, numerical integra- 
tion appeared to be the simplest. The integrand xp-^{1 — represents a smooth 
curve and was produced at interval 0-01 with the help of logarithmic tables and 
checked by differencing on the National machine. Numerical integration was 
performed by Gauss’ formula and the integral B^{p,q) checked by differencing. 
Finally, x was obtained by inverse interpolation and checked by the application 
of Taylor’s expansion at the tabular value nearest to x. This method was used for 
2q = 120 and,2p = 24, 26, 28, 30, 40, 60 and 120 and also for 2p = 120 and 
2q = 24, 30, 40 and 60. 

For 2q = 120 and 2p = 7(2)29 all percentage points were obtained by p-wiae 
interpolation between the entries 2p = 2(2)30, 40 and 60, using appropriate 
formulse of the Lagrangian type. 

(8) Approximation by the incomplete P-function. It remains to consider the 

entries for 2p = 120 and 2q — 1(2)9. 

There appears to be a lack of suitable methods for obt^^ining accurate results in 
this range of q for isolated large values of p. Three-decimal approximations Xo to 
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the percentage levels may be obtained by harmonic interpolation, as described 
in §(5).* To obtain the correct percentage points {x = Xq-^%), the main task 
consists in calculating -4^(60, s') to six places of decimals. This was done with the 
help of a recently developed approximate formula giving q) in terms of the 
incomplete T-function-f This formula is akin to a Taylor expansion of I^{p, q) 
in powers of at l/2p = 0 (2y> = oo) and may be written as follows'. 

1 --4b. 2) S I{n, q~\) “ ‘1 ’ 


where 


u 


and 

x^Jq X 


and the terms Tj are dependent on A and q only, the first two being 


Ti = q~l-\ 

n=42®--|2H|g-i + ('|sH%-i)A+(|gr-i)A2-iA3. 

This formula is very accurate for large p and small or moderate q. The terms 
and Jo were calculated in each case for 


where Xq denotes a suitable three-decimal approximation to the true percentage 
point. To obtain we make use of the fact that equation (15) should yield 
4(60, q) to seven-decimal accuracy. Since 4.(50, q) is obtained to that accuracy 
from Pearson’s table and since T^, and Tj depend on A and q only, we may use 

7) 

equation (15) to determine Jg by substituting p = 50, A f= Ap, x = x^ = ~ — , 

Afl+jP 

qi - JW . _ 'With Tq, Jj and % computed, 4 (60, q) is easily obtained from 

equation (15). Finally the exact percentage point Xq + A is calculated by the 
iteration process (13). 

Checks 


The main body of each table of percentage points (i.e. the interior of each table) 
was checked by differencing p-wise and g-wise at interval For large q and 
moderate p, x may be considered as a function of l/2g and differenced at interval 
1/120, i.e. for l/2g = 0/120 (1/120) 6/120. Similarly a; maybe differenced for large 
pandmoderatevaluesofgfor l/2p == 0/120(1/120) 6/120. Four significant figures 
may be checked in this way, thus eluninatiug any possibility of serious errors.. 
For smallp and large q, the quantity x is almost linear in 1 /2g so that a good check 
was given by examination of the product 2qx, which is almost constant. Never- 

* In thia case it was sufficient to use a Lagrangian formula for interpolation between values of x 
(with a: = 1 for 2p = «>), instead of performing the more complicated interpolation between values 
of 2^){1— »). 

t Ihe derivation of this formula ia given in a paper by H. 0. Hartley wliioh. will, it is hoped, 
be published in the next issue of Biometrilca. 
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theless, the only available check to guarantee five-decimal accuracy at the margins 
was reeomputation. As far as possible repetition of the method employed in the 
first instance was avoided. Thus inverse interpolation was replaced by direct 
interpolation or by a Taylor expansion at a tabular value; iteration processes 
were varied in the formulae employed. 


METHODS OE INTERPOLATION 
By H. 0. HARTLEY 
iNTRODirOTION 

In SO far as the table is required in coimexion with standard tests of significance 
the user will be concerned with obtaining x for any percentage level P and for 
integer values of and ~ 2q. 

The values of P and the row and column headings (2p, 2q) have been selected 
in such a way that the user will generally find the required value of x tabulated. 
Moreover, for most of the applications it. suffices to estimate roughly the magni- 
tude of interpolates from an inspection of the table. In some cases, however, 
interpolation to about five^decimal accuracy is necessary. The problem of inter- 
polating between oorresponding entries in different tables of percentage points 
(interpolation P-wise) will, it is hoped, be dealt with elsewhere and we are here 
only concerned with interpolation in each individual table of percentage points 
x{2p, 2q) to find x for any combination of integer arguments 2p, 2q* 

In the present tables interpolation to integer arguments = 223, Vj = 2q 
will occur in three different forms: 

(1) Single-entry interpolation g'-wise in the range 1 ^ 2^ ^ 30 and 10 ^ 2g < co. 

(2) Single-entry interpolation p-wise in the range 1 ^ 2g ^ 10 and 30 < 2p < oo. 

(3) Double-entry interpolation for 30 ^ 2p, 10 ^ 2g. 

If both 2p and 2g are large, interpolation in the tables is impractical, and it 
Was therefore necessary to add a fourth section, namely: 

(4) Approximate calculation of percentage points if both p andg are large. 

It will be noted that, following the lay-out adopted in other tables of per- 
centage points (3, 4), the column headings = 2g = 20, 24,- 30, 40, 60, 120 
and 00 are in harmonic progression. If, therefore, l/2g is used as a variable, 
these columns form a tabulation of x at equidistant intervals of the variable 
l/2g. The same harmonic progression is given for the row headings = 2p, 
although here the tabulation at unit interval goes up to 2p ~ 30, because of 

* Fractional values of 2$ and occur in a number of applications when the percentage points 
of certain Pearson-type curves are required. In such oases it will be found most convenient to 
apply single-entry interpolation formulic, first in one direction (p or q) and then in the other. 
Methods akin to those given here cover the range 25 ' >10, 2?) >30. For the range 0^22X30, 
0<2g^l0, successive single-entry interpolation (jj-wise or g-wise) at unit interval should afford 
no difficulty provided the arguments of the interpolate do not lie within the strips 0<2p<3, 
0 < 2g < 2. Within these strips interpolation cannot be carried out without the aid of auxiliary tables. 



162 Petceutage points of incomploto hetci-f unction 

the importance of these values for certain tests of significance. The use of the 
variables l/2g and l/2p greatly facilitates interpolation .but even with this 
device (known as harmonic interpolation) high-order interpolation formulee 
have to be used in many c§,ses, if the accuracy of the tabular values is required. 

To facilitate single-entry interpolation, therefore, an auxiliary table of Lagran- 
gian coefficients has been prepared. Although this auxiliary table has been 
specifically designed to meet the requirements of the present tables of percentage 
points of X, it is given and described in a separate paper (p. 183) since it is felt 
that it will have a wider application to any table of percentage points with a 
similar lay-out. 


1. SmaM-ENlRY mTEEPOLATION g-WISE 
No interpolation is required for the range 1 10(K 10). Tor 

Vi = 2q> 10 interpolates are obtained with the help of the auxiliary table on 
pp. 183-6 of tliis issue, and its use is best explained in terms of an examifie. 
Example 1. Tind the 6 % point corresponding to 2p = 26, 2^ = 74. 

In the auxiliary table (p. 186 below) enter the row headed 74, that is, the row 
whose heading is equal to the value of 2q for which the interpolate is required. 
The entries in this row are the (Lagrangian) multipliers in a sum of products 
which yields the interpolate x. The corresponding multiplicands are taken from 
the table of 6 % points x (2p, 2q), We enter the row headed 2p = 26 and select 
entries a:(26, 2q) for 2q — 20, 24, 30, 40, 60, 120 and co. These correspond to the 
column headings in the auxiliary table. The sign of each product is also given at 
the top of the columns. We have, therefore, 

x(26,74) = 0-395 16x0-006 867-0-357 56x0-046 623 -t- 0-313 14x0-162 013 

-0-259 66x0-372 737-1- 0-193 79x 1-018 370-f 0-110 24 x 0-247 951 
= 0-164 64. 

This may be compared with the exact value 0- 1 64 637 obtained by inverse 
interpolation from Pearson’s tables. 

The accuracy of the interpolates depends on 2p and 2q and (to a lesser extent) 
on the percentage level P. In favourable cases, if 2p is small and 2q moderate, 
the interpolate is accurate to 6 places of decimals. In the least favourable oases, 
for 2p near 30 or 2q large, the fifth decimal of the interpolate may be in error. One 
more example is given to demonstrate the use of the auxiliary table. 

Example 2. Tind the 50 % point for 2p = 11, 2q ~ 17, 

Entering the row 17 in the auxiliary table and the row 2p = 11 in the table 
of 50 % points we have 

*(11, 17) = ■+• 0-525 38 X 0-003 097-0-476 96 x 0-037 459 + 0-419 02 x 0-409 711 
+ 0-348 45x1-213 968-0-307 07x0-866 212 + 0-260 64x0-315 162 
-0-208 18x0-048 267 
= 0-387 62. 
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It will be noted that for 2q = 16, 17, 18 and 19 two alternative rows are given in 
the auxiliary table, one (which has been used in the above example) is under the 
heading ‘Harmonic’ Lagrangian coefficients. The other row contains ‘Ordinary’ 
Lagrangian coefficients. It is in this range of that there is little to choose 
between the merits of ordinary and harmonic interpolation, and the use of both 
methods provides a good check. Reworking the above example and using ordinar;y 
Lagrangian coefficients we have 

33(11, 17) = -0-663 46 X 0-306 397 + 0-525 38 x 0-780 000-0-476 96 x 0-983 025 
+ 0-419 02x1-258 272 + 0-348 45x0-289 546-0-307 07x0-040 124 
+ 0-260 64x0-001 728 
= 0-387 62. 

There is satisfactory agreement between the two interpolates and the. exact 
value (obtained from Pearson’s table) which is 0-387 619. 


2. SlNGLE-miBY INTERPOLATION ^-WISL 

No interpolation is required for 1 < 2 p < 30 (1 < Vg ^ 30). For = 2p> 30 we 
again use the auxiliary table onpp. 184, 185 of this issue. This time, however, the 
argument 2p of the interpolate determines the row to be entered jn the auxiliary 
table, whilst column headings of this table are made to correspond to selected 
rows in the table of percentage points. The method is best explained by the 
following examples. 

Example 3. Find the 0.-5 % point corresponding to 2 g' = 4 and 2p = 96. In 
the auxiliary table enter the row headed 96. The entries in this row are the 
Lagrangian multipliers. The corresponding multiplicands are taken from the 
table of 0-5 % points. We enter the column headed 25 = 4 and select entries 
x{2p, 4) for 2p = 20, 24, 30, 40, 60, 120 and co which correspond to the column 
heading in the auxiliary table. 

The sign of each product is also given at the top of the columns. We therefore 
have 

33(96, 4) = + 0-491 44 X 0-005 875 - 0-550 98 x 0-044 647 + 0-618 64 x 0-152 207 
-0-696 71x0-318 909 + 0-783 70x0-658 090 + 0-884 42x0-669 708 
-1-000 00x0-022 324 
= 0-857 94, 

which differs by 2 unilfe in the fifth decimal from the exact value (0-867 92) 
obtained by inverse interpolation from Pearson’s tables. 
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Mmmpk, 4. Find the 5 % point corresponding to 2p — 80 and 2g' = 30. We 
have 

a:(80, 30) = +0-246 39x0-006 836-0-292 08x0-062 734 + 0-362 00x0-184 570 
-0-433 21x0-410 156 + 0-548 07 x 0-922 851 + 0-720 16 X 0-369 141 
-1-000 00 x 0-020 508 
= 0-624 69. 

The exact value is 0-624 76. 

Again, the accuracy of the interpolates depends on 2g, 2p and to a lesser 
extent on P. Five-decimal accuracy is obtained for small 2q and moderate 2p, 
whilst only 4 decimals are reliable if 2g is near 30 or 2^ is large. 

3. Harmonic notiBLB-BNTBY intebpolation 
In this section we deal with interpolation in the range 
30^2j3<OO, 10;^ 22 <00, 

provided the arguments of the interpolate are not ‘ too large ’ . The exact meaning 
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of this restriction is that if the methods described below are used to find inter- 
polates in the range 

60 ^ 2^)< oo, 404 2g< oo, 

the results obtained are unsatisfactory. In this range the user should, therefore, 
proceed on lines described in section 4. 

The method is essentially double-entry interpolation between points of the 
lattice work shown in Pig. 1. 

The 5, 2|, 1 and ^ % values (i.e. the quantities a: for P = 0-05, 0-026, 0-01 and 
0-005) are practically linear to three-figure accuracy in the diagonal direction 
indicated by the broken lines in Fig. 1, 
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To explain the method it will be convenient to regard the percentage points 

X as functions of ^20 60 

V ~ 'IT' and £ == — , 

2p ^ 2q’ 

and to introduce the notation 

The relation between the argument p, i and 2p, 2q is demonstrated in Fig. 1. 
To obtain the interpolate x for any p, q in the above range calculate 


and find 


. 60 , 120 
4=^ and J/ = -r- 
2q ' 2p 

S = integral part of £, 

If = integral part of 7j, 

H = (^~S) + (9-7/)-1. 



Pig. 2. 


If (i is positive consider the parallelogram with vertices at [77, ■5'+ 1], [77+ 1,5], 
[77+1,5'+ 1] and [77+2,5] (see Fig. 2). Now calculate two interpolates x^ and 
% at points and P* from the (approximate) formulae 

= /ta;[77+l,5+I] + (l-/t)a;[77,5+l]| 

Xi^ fix[H+2,S] + (\~(i)x[H+l,E\ ) 

and finally find the interpolate x\r), 

+ (17) 

If p is negative the points Pj and will be at distance fi below the points [77, 5 + 1 ] 
and [77+ 1, 5] respectively, and we have the formulae 

*2 - ~/<aj[77,S] + (l+/t)a:[77+ 1,5] 
in place of equations (16). 
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Example 5. Find the 1 % point a; for 2p = 42, 2? =' 16. 

We have 

g=3-76, = 2-8571, S = 3, /f=2, /t = 0-6071. 

To apply formnlse (16) the tabular values are taken from the table of 1 % points, 
where we find to 4'decimal accuracy 

a;[3, 4] == a:(40, 15) = 0-5140, *[4, 3] = x{30, 20) = 0-3706, 
x[2, 4] == *(60, 15) = 0-6297, *[3, 3] = *(40, 20) = 0-4578, 

Applying the equations (16) we obtain 

*1 = 0-6071x0-5140 + 0-3929x0-6297 = 0-5695 
*2 = 0-6071x0-3706 + 0-3929x0-4678 = 0-4048, 

Finally we calculate 

*[2-857, 3-75] = *(42, 16) = 0-76 x 0-6695 + 0-25 x 0-4048 = 0-5208. 

The exact value obtained from Pearson’s table is 0-5] 63, If higher accuracy is 
required we have to improve the approximate relations (16) by adding the 
second-order difference effect. If this is done we obtain 

*1 = 0-5565, *2 = 0-4016 and * = 0-5170, 

which agrees satisfactorily with the exact value. If this method is applied to the 
tables of 10, 26 and 50 % points and if a similar precision is required, the second- 
order difference effect should also be considered when interpolating along the 
diagonals. In such cases the right-hand side of equation (17) should have four 
terms. 

4. CALOULATIOU' OS’ PERCENTAGE POINTS IE BOTH p AND q ARE LARGE 

If * is required for values of 2p and 2gf in the range 2p > 60, 2y > 40, inter- 
polation between the tabular values is not possible because of the singularity of 
* at 2p = 00, 2q = CO . In this range therefore * has to he calculated ab initio. 
Certain approximate foimulaa fox the incomplete beta-function axe valid in 
this range (8, 10 ). These formulae, whilst useful for a calculation of P = q) 
as functions of *, 2p and 2g, cannot be easily inverted to yield * for the given 
percentage levels P. 


Auxiliary table 

y = normal deviate at level ^ = -^( 1 /“ + 3) 


p 

0-60 

0-25 

0-10 

0-06 

0-026 

0-01 

0-005 

y 

0-0000 

0-6746 

1-2810 

1-6449 

1-9600 

2-8268 

2-6768 

A 

0-6000 

0-6768 

Q'lni 

0-9.509 

1-1402 

1-4020 

1-6068 

A -4 

0-3333 

0-4092 

0-6071 

0-7843 

0-9736 

1-2363 

1-4392 


A more convenient approximation of sufficient accuracy has recently been 
given by Ooohran(i), who extended a method suggested by Fisherii), It is 
essentially an approximation (by the normal distribution) to Fisher’s 2 -trans- 
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formation of x and it involves values of the normal deviate y at the appropriate 
percentage levels P. These nornaal deviates y together with a function of y 
denoted by A are tabulated above for the levels P with which we are concerned 
here. 

To find an approximation to x for any pair of arguments 2p, 2q in the above 
range, calculate in turn the quantities 


2p + 2? 

z = y (A-i)(^-2j)) 
^j{A — X) pA 


X = 

* 2pi-2qe^' 

As examples we consider two tabular values of a: .in order to obtain some idea of 
the accuracy of the approximation. 

Example 6. P s= 0*01, 2p = 120, 2q = 40, 


A = 


8x60x20 _ 2-3263 l-236(-60) 

160 “ ’ *~V58'598'^ 60x60 


0-2833, X = 0-6299. 


This agrees with the exact value to four decimals. 
Example 7. P = 0-60, 2p = 30, 2q = 120, 


A = 


8 X 16x60 
160 


= 48, 


0-333x18 

16x48 


0-00833, X = 0-1974. 


This differs from the exact value by about a unit in the fourth decimal. 
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Bbta Distribotion: 60 per cent Points job a? 


vj=2? Vi~ip 


V 

1 

2 ' 

3 

4 

6 

0 

7 

8 

9 

1 

0-50000 

0'26000 

0-16319 

0-12061 

0-096626 

0-079033 

0'067378 

0 - 0587 I 1 

0'062016 

2 

■75000 

■50000 

•37004 . 

■29289 

•24214 

•20630 

■17966 

•16910 

•14270 

3 

■83681 

■02996 

■60000 

■41363 

•36246 

•30695 

•27181 

•24386 

•22112 

4 

■87939 

■70711 

■58637 

■60000 

■43666 

•88673 

■34609 

■31381 

■28703 

5 

0-90447 

0‘76786 

0-64786 

0-66444 

0-60000 

0'44867 

0-40684 

0-37213 

0-34286 

6 

■92097 

■79370 

■69305 

■61427 

■65133 

•50000 

•45737 

•42141 

■39068 

7 

■93262 

•82034 

■72819 

•66391 

•69316 

•54263 

•60000 

•46365 

•43206 

8 

■94129 

•84090 

■76614 

•68619 

•62787 

■67859 

•63646 

■50000 

•46818 

9 

•94799 

•85724 

•77888 

■71297 . 

•65714 

■60932 

•60796 

•53182 

•60000 

10 

0-95331 

0^87066 

0’79775 

0'73555 

0-68214 

fl '63588 

0-69546 

0-55984 

0-52824 

11 

■96766 

•88169 

•81366 

•76484 

•70378 

•86907 

■61968 

•58471 

■65346 

12 

•96125 

•89090 

•82725 

•77161 

•72262 

•67948 

•64116 

•60692 

■67613 

13 

■96429 

•89886 

•83899 

•78608 

■73923 

•69759 

•66036 

•62687 

‘69661 

14 

■96689 

•90672 

•84924 

■79887 

■76396 

•71376 

■67760 

•64490 

•01620 

16 

0'96913 

0^91172 

0-86827 

0-81023 

0-76712 

0-72830 

0-69318 

O -60127 

0'63216 

16 

•97109 

•91700 

•86627 

•82038 

•77894 

•74143 

•70732 

•67620 

•64768 

17 

■97282 

•92169 

•87342 

•82960 

•78963 

•76334 

•72022 

•68986 

•66195 

18 

•97436 

•92687 

•87986 

•83774 

•79932 

■76421 

•73203 

•70242 

•67611 

19 

•97672 

•92964 

•88665 

•84622 

■80817 

•77417 

•74288 

■71401 

•88728 

20 

0-97696 

0'93303 

0-89092 

0-86204 

0*81626 

0-78331 

0-76289 

0-72472 

0'69868 

21 

•97806 

•93612 

•89673 

■86828 

■82370 

■79176 

•76216 

•73467 

•70909 

22 

•97907 

■93893 

•90013 

•86402 

•83067 

■79966 

■77074 

■74392 

•71889 

23 

■97999 

■94151 

•90417 

•86931 

•83692 

■80679 

•77873 

•75254 

■72806 

24 

■98083 

■94387 

■90790 

•87421 

•84281 

•81363 

■78618 

•76061 

■73663 

25 

0'98161 

0-94606 

0'91136 

0-87876 

0-84828 

0-81981 

0-76316 

0-76816 

O -74460 

28 

•98232 

■94808 

■91465 

■88298 

■86340 

•82668 

■79968 

•77626 

•76227 

27 

■98298 

■94996 

•91763 

■88692 

•85817 

•83118 

•80681 

■78193 

•76941 

28 

•98360 

■96170 

•92031 

■89060 

■86266 

•83636 

■81167 

•78821 

•76616 

29 

■98417 

■96332 

■92290 

■89408 

■86686 

•84120 

■81701 

•79416 

•77263 

30 

0-98470 

0-96484 

0’92534 

0'89730 

0-87080 

0-84678 

0-82214 

0'79976 

0*77866 

40 

•98856 

•96694 

•94324 

•92136 

•90038 

■88030 

■80107 

•84266 

•82601 

60 

•99238 

•97716 

•96164 

■94646 

■93168 

■91731 

■90338 

■88986 

•87672 

120 

■99620 

•98861 

•98066 

■97264 

•96482 

•95710 

•94961 

•94202 

•93466 

CO 

I'OOOOO 

hOOOOO 

1-00000 

1-00000 

1-00000 

i 

1 '00000 

1-00000 

1-00000 

1-00000 


This table gives the values of x for which (j), 9i)=0’50 where = g=ivj. 
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Beta Distbibotion": 50 pm oent Points pob so 
>> 8 = 2 ^ 




1 O’ 046687 

2 '12946 

3 '20325 

4 '26446 


0-038746 0-030867 0-023062 0-019168 0-015301 0-011460 ' 0-0076166 0-0037997 
•10910 -088278 -066967 -066126 -046168 -034064 , -022840 -011486 

-17276 -UllB -10908 -092099 -074664 -066766 

•22849 -18977 *14796 •12679 '10270 -078644 



Biometrika xxxii 


12 



















170 Percentage points of the incomplete beta-function 


Beta DisraiBtrTioiir: 26 pee cent Points foe x 


h-H 


V 

H \ 

1 

2 

3 

4 

6 

6 

7 

8 

9 

1 

0'14646 

0-062600 

0-039063 

0-028309 

0-022173 

0-018216 

0'016463 

0-013416 

0-011863 

2 

•43760 

■25000 

•17452 

•13397 

•10870 

•091440 

•078908 

•009395, 

•061929 

3 

■69716 

•39686 

•29801 

•23886 

•19937 

■17113 

■14991 

■13339 

•12015 

4 

•68878 

•60000 

•39448 

•32636 

■27862 

•24302 

•21560 

•19376 

■17696 

6 

0'74711 

0-67436 

0-46936 

0-39776 

0-34646 

0-30660 

0-27390 

0-24828 

0-22707 

6 

•78726 

•62996 

•52848 

■46632 

•40198 

•36944 

•32616 

•29692 

•27323 

7 

•81660 

•67296 

•57609 

•50494 

•46001 

•40614 

•37021 

■34022 

•31478 

8 

•83872 

■70711 

•61616 

•64682 

•49117 

•44680 

■40996 

•37886 

•36219 

9 

•86616 

•73487 

•64773 

•68060 

•62678 

•48246 

•44521 

•41343 

•38697 

10 

0^87021 

0-76788 

0-67629 

0-61062 

0-66783 

0-51390 

0-47662 

0-44461 

0-41666 

11 

<88177 

•77720 

•69888 

•63661 

■68613 

•64184 

■60476 

•47257 

•44435 

12 

•89144 

•79370 

•71931 

•66929 

•60930 

■66679 

•63009 

•49801 

•46970 

13 

■8996j8 

•80793. 

■73716 

•67941 

•63085 

•68921 

•66300 

•62116 

■49289 

14 

•90672 

•82034 

•75288 

•69730 

•66017 

•60946 

•67382 

•64230 

• 

•61419 

16 

0^91286 

0-83124 

0-76684 

0-71332 

0-66768 

0-62782 

0-69282 

0-66169 

0-63380 

16 

•91823 

•84090 

•77932 

•72773 

■68338 

•64466 

■61021 

•67963 

•66192 

17 

•92298 

•84961 

■79053 

•74077 

•69772 

■66986 

•62619 

■69699 

•66870 

18 

•92721- 

■86724 

•80066 

•78263 

•71084 

•67391 

•64093 

■61122 

•68428 

19 

•93100 

•86422 

•80986 

•76346 

' -72287 

■68686 

•66460 

•62636 

•69879 

20 

0^934'42 

0-87066 

0-81826 

0-77337 

0-73396 

0-69882 

0-66720 

0-63862 

0-61234 

21 

■93761 

•87632 

•82693 

•78260 

•74418 

•70991 

•67896 

•65079 

•62600 

22 

•94033 

•88169 

•83299 

•79092 

•76366 

•72021 ■ 

•68991 

■66226 

•63688 

23 

•94290 

•88644 

■83950 

•79871 

•76246 

•72981 

‘70016 

•67300 

■04803 

24 

•94526 

•89090 

•84663 

•80596 

•77066 

■73878 

•70973 

•68309 

•66862 

26 

0^94744 

0'89503 

0-86112 

0'81268 

0-7,7831 

0-74717 

0-71873 

0-69268 

0-66840 

26 

•94944 

•89886 

•86632 

•81896 

•78647 

•76606 

■72719 

■70161 

■67774 

27 

•96130 

■90241 

•86116 

•82484 

•79218 

•76244 

•73616 

•70996 

■68667 

28 

•96303 

•90672 

•86670 

•83035 

•79848 

•76941 

■74267 

•71793 

■69492 

29 

•96464 

•90882 

•86994 

■83662 

•80442 

■77698 

•74977 

•72648 

■70286 

30 

0-96614 

0-91172 

0-87393 

0-84039 

0*81002 

0-78219 

0-76649 

0-73263 

0-71038 

40. 

•96706 

•93303 

•90361 

•87686 

•85230 

•82947 

•80809 

■78797 

•76890 

60 

■97801 

■96484 

•93434 

■91648 

•89782 

•88113 

•86626 

•86009 

■83657 

120 

•98899 

•97716 

•96648 

•96647 

•94892 

• •93774 

•92887 

•92026 

•91187 

CO 

1-00000 

1-00000 

1-00000 

l-OOOOO 

1-00000 

l-OOOOO 

l-OOOOO 

1-OQOOO 

1-00000 


This table gives the values of a; for which (jj, g)=0'26 where g= ivi- 
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Beta Bistbibution; 25 per cent Points for x 


Vx=2q v,=2^) 


yi 

''a\ 

10 

12 

16 

20 

24 

30 

40 

00 

120 

I 

O-O1O610 

0-0087814 

0'0069734 

0-0061914 

0-0043101 

0'0034363 

0-0025669 

0’0017049 

0'0’84926 

2 

•066913 

•048818 

•037631 

•028368 

•023689 

•018990 

•014281 

•0095436 

•0047832 

3 

•10930 

•092692 

•076324 

■067467 

•048307 

•038986 

■029600 

•019844 

■010012 

. 4 

•16116 

•13707 

•11360 

•087610 

•074096 

•060174 

■046827 

•031031 

•016764 

5 

0-20922 

0-18082 

0-16026 

0-11726 

0-099749 

0'081498 

0-062468 

0'042571 

0-021774 

6 

•25307 

■22068 

•18600 

•14586 

•12476 

• 10251- 

•079043 

■054222 

•027923 

7 ■ 

•29291 

•26724 

■21768 

•17316 

•14888 

•12301 

•096406 

•066867 

•034143 

8 

•32908 

•29099 

•24802 

•19913 

•17203 

•14289 

•11146 

•077403 

■040396 

9 

■36198 

•32206 

•27644 

•22376 

•19420 

•16211 

•12712 

•088814 

•040666 

10 

0'39196 

0- 36068 

0'30297 

0-24710 

0-21638 

0-18064 

0'14240 

O' 10006 

0062904 

11 

•41938 

•37712 

•32776 

•26921 

•23662 

■19860 

•16726 

•11113 

•069127 

12 

•44461 

•40168 

■36094 

•29017 

•25493 

■21670 

'171,71 

■12200 

•066317 

13 

•40782 

■42426 

•37266 

•31004 

•27338 

•23226 

•18676 

•13267 

•071466 

14 

•48893 

•44634 

•39302 

•32889 

•29100 

•24819 

•19938 

•14314 

•077670 

15 

0'60863 

0-46497 

0'41215 

0-34879 

0-30786 

0-28353 

0-21281 

0'16341 

0'083624 

16 

•62691 

•48330 

•43016 

•36380 

•32396 

•27831 

•22646 

•16347 

■089626 

17 

•64389 

•60043 

•44712 

•37998 

•33936 

•29264 

■23793 

■17332 

•096671 

18 

■66972 

•61649 

•46312 

■39639 

•36411 

•30626 

•26004 

•18298 

•10146 

19 

•67449 

•63168 

•47826 

■41008 

•36824 

•31946 

■26179 

•19244 

■10729 

20 

0^68832 

0-64574 

0-49266 

0-42409 

0-38179 

0-33221 

0'27321 

0-20170 

0’11307 

21 

■60129 

•66909 

•60613 

•43746 

•39478 

■34460 

•28430 

•21078 

•11878 

22 

■81348 

•67169 

•61900 

•46026 

•40726 

•36637 

■29607 

■21966 

•12443 

23 

•62496 

•68360 

•53122 

•46247 

•41926 

•36783 

•30664 

■22837 

'13003 

24 

•03676 

•69487 

•64286 

•47418 

•43078 

•37891 

•31672 

•23690 

•13866 

26 

0’64697 

0-60656 

0-66392 

0-48639 

0-44186 

0-38961 

0'32661 

0-24625 

0- 14103 

26 

•66563 

■61670 

•60447 

•49616 

•46283 

•39996 

•33524 

■25343 

•14645 

27 

•68478 

•62633 

•67466 

•60647 

•46281 

•40997 

•34460 

•26146 

•16180 

28 

•07346 

•03460 

•68417 

•51639 

•47272 

•41967 

■36371 

•26931 

■16710 

29 

■68170 

•64323 

•69337 

•62691 

•48228 

•42906 

•36269 

-27701 

•16234 

30 

O' 68964 

O'06168 

0'60217 

0'63808 

0-49160 

0-43816 

0-37122 

0-28453 

0'ie752 

40 

=76095 

•71768 

•67308 

•61064 

•66863 

•61556 

■44660 

•86246 

•21626 

60 

■82163 

•79629 

•78915 

•70620 

•66914 

•62066 

•66390 

■46630 

•29913 

120 

•90370 

•88794 

■86664 

■83103 

•80657 

■77041 

•71862 

•63381 

•46918 

CO 

I'OOOOO 

I'OOOOO 

I'OOOOO 

I'OOOOO 

1-00000 

1-00000 

1-00000 

1-00000 

I'OOOOO 


For ^1= 00, a :=0 


13-2 
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Percentage points of the inc&mplete heta-functitin 


Beta Distbosotion: 10 peb cent Portts fob x 

vj =25 vi=ip 



1 0-024472 O'OIOOOO O-OO01812 0-0044577 0-0034818 0-0028S53 

2 -19000 -10000 -067830 -061317 -041268 -034611 

3 -36130 -21644 -16648 -12310 -10154 -086434 

4 -46812 -31623 -24136 -19680 -16493 -14266 


6 0-66185 

0 -61375 

7 -66104 

8 -69821 

9 -72814 

10 0-76273 

11 -77328 

12 -79069 

13 -80604 

14 -81861 

16 0-82996 

16 -83998 

17 -84889 

18 -86686 

19 -86403 

20 0-87062 

21 ,-87643 

22 -88181 

23 ,-88876 

24 -89129 

26 0-89649 

26 -89937 

27 -90297 

28 -90633 

20 -90946 

30 0-91239 
40 -93381 

60 -96665 
120 -97761 

CO 1-00000 


0-39811 

•46416 

•61796 

•66234 

-69948 

0-63096 

■66793 

•68129 

■70170 

•71969 

0-73564 

•74989 

•76270 

•77426 

•78476 

0-79433 

•80309 

•81113 

•81856 

•82640 

0-83176 

■83768 

•84319 

•84834 

•86317 

0- 85770 
■89126 
•92012 
■96236 

1 - 00000 


0-31529 

•37816 

•43151 

•47700 

•61610 

0-64996 

■67964 

•60866 

•62860 

•04916 

0-66758 

•68419 

•09923 

■71293 

-72644 

0-73091 

•74747 

•76722 

•76626 

•77464 

0-78245 

■78973 

•79668 

•80294 

■80894 

0- 81469 
•86693 
•90182 
•94944 

1 - 00000 


0-26204 

■32046 

•37151 

•41611 

•45622 

0-48968 

■62022, 

•64744 

•67181 

■69376 

0-61380 

•63164 

•64809 

•66316 

•67699 


0-22467 

•27868 

•32686 

•36982 

•40811 

0-44232 

•47300 

•80062 

•62660 

•54827 

0-66893 

•68783 

•60617 

•62114 

•63688 


0-19664 

•24664 

•29210 

•33319 

•37029 

0-40382 

•43419 

•46178 

•48693 

•60992 

0-63100 

■68040 

•66829 

•58484 

•60020 


986 0-0018628 
90 -023141 
47 -059809 
4 -10147 


0-68978 0-64964 0-61448 

•70168 -66222 -62779 

•71260 -67403 -64022 

■72268 -68604 -66187 

■73216 -69536 -66279 

0-74103 0-70600 0-67306 0-64434 

■74933 • -71407 -68271 ■ 

•76711 -72260 -69183 

■76443 -73064 -70044 

■77132 -73823 -70868 

0- 77783 0-74641 0-71630 0-68984 l0-66i 

■82706 -80026 -77678 

•88023 -86048 -84212 

•93773 -92679 (91643 

1 - 00000 1-00000 1-00000 1-00000 



This table gives the values of x for -which {p, «)=0-10 where p-ivi, ?=iv,, 
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Beta Disteibution: 10 pee obm Points poe x 


vi=2g i'j=2p 


B 

B 




24 

30 

40 

60 

120 

1 

0'0016686 

0'0013709 

0^0010878 

0-0380919. 

0-0»67167 

0-0®53806 

0-0*39965 

0'0*'26536 

0-0*13213 

2 

‘020862 

•017407 

■013960 

•010481 

•0087416 

•0069994 

•0052642 

•0036069 

•0017646 

3 

•064246 

■046740 

•037035 

•028119 

•023679 

•018982 

■014327 

■0090132 

•0048379, 

4 

■092596 

■078823 

•064482 

•049462 

■041691 

■033749 

■026617 

•017288 

•0087621 

6 

0‘13167 

0'11307 

0^093336 

0-072324 

0061295 

0-049889 

0-038083 

0-026861 

0-01S167 

6 

■16964 

•14688 

•12228 

■098663 

•081477 

•066668 

■081174 

■034941 

•017906 

7 

■20673 

•17941 

•15069 

•11886 

•10173 

•083668 

■064573 

•044346 

•022866 

8 

■23966 

•21040 

■17792 

•14161 

•12177 

•10064 

■078083 

■063928 

■027978 

9 

■27139 

■23970 

•20411 

•16374 

•14141 

■11743 

■091677 

•063600 

•033196 

10 

0'30097 

0^26732 

0'22908 

0-18513 

0-16066 

0- 13394 

0-10497 

0-073298 

0-038489 

11 

■32863 

•29330 

•26284 

■20876 

•17915 

•16010 

•11820 

•08?977 

•048832 

12 

•36422 

•31772 

•27540 

•22669 

•19716 

•16687 

■13123 

■092004 

■049206 

13 

•37817 

•34068 

•29682 

•24464 

•21467 

•18124 

■14403 

■10215 

•054597 

14 

•40063 

•36228 

■31716 

•20292 

•23139 

•19619 

•16669 

•11161 

•069993 

15 

0^42143 

0^38261 

0'33646 

0-28046 

0*24762 

0-21072 

0-16889 

0-12096 

0-065386 

16 

•44100 

•40176 

•35478 

•29726 

•26327 

•22483 

■18093 

•13019 

•070768 

17 

•45934 

•41983 

•37219 

•31338 

•27837 

•23863 

•19270 

•13030 

■076134 

18 

•47687 

•43889 

■38876 

•32886 

•29293 

•25182 

•20420 

•14828 

•081478 

19 

•49277 

•45302 

■40451 

•34369 

•30697 

•26471 

•21644 

•16712 

•080790 

20 

0^50803 

0^48829 

0’41962 

0-35793 

0-32051 

0-27721 

0-22642 

0-16583 

0'092086 

21 

■62243 

•48276 

■43382 

•37161 

•33368 

•28934 

•23713 

•17440 

•097342 

22 

•53603 

•49649 

■44746 

•38476 

•34619 

•30111 

■24759 

•18283 

•10267 

23 

•64889 

•60963 

•46049 

•39738 

•35836 

■31263 

■26781 

■19112 

■10776 

24 

■66108 

•62193 

•47294 

•40964 

•37012 

■32361 

•26778 

•19928 

■11290 

25 

0'57263 

0^83373 

0-48485 

0-42123 

0-38147 

0-33437 

0-27761 

0'20730 

O'llSOl 

26 

•58361 

•64498 

■49624 

•43248 

•39246 

■34481 

•28701 

■21618 

•12308 

27 

■69406 

•55671 

■60716 

•44333 

•40306 

•36496 

•29629 

•22293 

■mil 

28 

•60398 

•66698 

•81763 

■45378 

•41332 

•36479 

•30634 

•23054 

■13310 

29 

•61344 

•67674 

■62767 

■46386 

•42326 

•37436 

•31419 

•23803 

■13804 

30 

0'62247 

0^68611 

0-63731 

0-47369 

0-43286 

0-38366 

0'32283 

0'24539 

O' 14295 

40 

•69412 

■66034 

■01699 

•85476 

•61428 

•46386 

•39910 

•31243 

•18960 

60 

■77851 

■75104 

■71386 

•66029 

•62333 

•67646 

•51067 

•41760 

•27063 

120 

■87897 

•86198 

•83814 

■80192 

•77663 

•73946 

■68688 

•60235 

■44158 

00 

POOOOO 

POOOOO 

1-00000 

1-00000 

1-00000 

1-00000 

1-00000 

1-00000 

1-00000 


For vi= 00, a=0 




174 Percentage points of the incomplete beta-function 


Beta DKTBiBtrTioir: 6 pib cent Points bob x 


^S=2g 


V 

1 

2 

3 

4 

6 

6 

7 

8 

9 

1 

0-0061658 

0-0026000 

0-0015429 

0-0011119 

0-0386820 

0-0371170 

o-oseo300 

0-0“62300 

0-0346170 

2 

•097600 

•060000 

■033617 

•025321 

•020308 

•016952 

■014648 

■012741 

■011334 

3 

•22862 

■13572 

•097308 

■076010 

•P62412 

•062962 

•046007 

•040871 

•036447 

4 

•34163 

•22361 

•16825 

■13536 

•11338 

■097611 

•086728 

•076440 

•068979 

5 

0’43074 

0-30171 

0-23663 

0-19403 

0-16628 

0-14408 

0-12778 

0-11482 

0-10427 

6 

•60053 

■36840 

•29599 

■24860 

•21477 

■18926 

•16927 

‘15316 

■13989 

7 

•66593 

•42489 

•34929 

■29811 

■26063 

•23182 

•20890 

•19019 

■17461 

8 

■60071 

■47287 

•39607 

•34259 

■30260 

•27134 

•24613 

•22532 

•20783 

9 

■63761 

■61390 

•43716 

■38245 

•34080 

•30777 

•28082 

•25836 

■23930 

10 

0-66824 

0-64928 

0-47338 

0-41820 

0-37663 

0-34126 

0-31301 

0^28924 

0-26894 

11 

‘69425 

•58003 

•60646 

■45033 

•40712 

•37203 

•34283 

•31807 

■29677 

12 

•71654 

■60696 

•63402 

■47930 

•43690 

•40031 

•37044 

•34494 

•32286 

13 

•73683 

■63073 

•66958 

■50551 

•4-6219 

•42636 

•39604 

•37000 

•34732 

14 

■76268 

■68184 

■68266 

■62932 

•48626 

■46036 

•41980 

•39338 

■37026 

15 

0-76764 

0-67070 

0-80333 

0^66102 

0-50836 

0-47266 

0-441S7 

0-41621 

0-39176 

16 

■78072 

■68766 

•62217 

•67086 

•62872 

•49310 

■46242 

■43663 

■41196 

17 

■79249 

•70297 

■63933 

■68907 

•54760 

•51217 

■48168 

■46474 

■43094 

18 

■80307 

■71687 

■86503 

•60684 

•66490 

■52991 

■49949 

•47267 

■44880 

19 

■81263 

•72964 

■86944 

•62131 

•68103 

■64646 

■51624 

•48951 

■46564 

20 

0-82131 

0-74113 

0-68271 

0-63664 

0-69605 

0-66189 

0-63194 

0-50635 

0-48162 

21 

•82923 

•76178 

■69496 

•64894 

•61004 

■67635 

■54669 

■52027 

•49652 

22 

■83647 

•76160 

■70632 

•66132 

■62312 

■58990 

■56066 

•63434 

•61071 

23 

•84313 

■77067 

•71687 

•67287 

•63636 

•60263 

■57363 

•54764 

•62416 

24 

•84927 ” 

•77908 

■72669 

■68366 

•64684 

■61461 

•58596 

■66022 

•53689 

25 

0-86494 

0-78690 

O-73680 

0-69377 

0-66764 

0-62590 

0-69761 

0-57213 

0-64898 

26 

■86021 

•79418 

•74444 

•70327 

■66780. 

•63666 

■60864 

■68343 

•66048 

27 

■86611 

■80099 

■76249 

•71219 

•67738 

■64663 

•61909 

•69418 

•67141 

28 

■80967 

■80736 

•76004 

•72060 

•68643 

•66617 

•62900 

•60436 

•68183 

29 

■87394 

■81334 

■76715 

•72864 

•69499 

■66522 

•63842 

■61407 

■59177 

30 

0-87794 

0-81898 

0-77386 

0-73604 

0-70311 

0-67381 

0-64738 

0-62332 

0-60126 

40 

■90734 

■86089 

■82447 

■79327 

•76569 

■74053 

•71758 

■69636 

■67663 

60 

■93748 

•90497 

■87881 

•86591 

•83617 

•81606 

■79824 

■78160 

■76669 

120 

■96837 

■95130 

•93720 

■92468 

•91290 

•90102 

•89148 

■88160 

■87191 

CO 

1-00000 

1-00000 

1-00000 

1 ■00000 

i-00000 

J 

1-00000 

1-00000 

1-00000 

1-00000 


This table gives the values of x for which 4 (ji, ?)!=0-0S where 2 J = |'>' 3 , 
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Beta Distribution: 6 per cent Points rob x 


'' 1 =% ''>=%) 


u 

f'lX 

10 



B 

B 


40 

00 

120 

1 




mi 

0 - 0 n 6727 

0 - 0 n 3326 

0 - 0*99538 

0 - 0*66082 

0 - 0*32904 

2 



■0068158 


•0042653 

■0034137 

•0026614 

■0017083 

■ 0*86462 

3 



■022466 


■014264 

■011472 

■0086511 

■0057991 

■0029167 

i 

H9 


■043641 


•028063 

■022679 

■017191 

•011686 

■0068667 

5 



0^067312 


. 0-043994 

0-036747 

0-027240 

0’018468 

0-0093841 

6 

■12876 

•11111 

■092207 


•061103 

■049898 

■038224 

•026043 

■013317 

7 

■16142 


■11733 


•078783 

■064661 

■049781 

•034103 

■017640 

8 

■19290 

■16875 

■14216 

■11267 

•096658 

■079696 

■061675 

•042481 

■021976 

9 


•19618 

■16638 

•13288 

•11449 

■094827 

■073748 

•061068 

■028672 

10 

0'25137 

0^22244 

0-18984 


013211 

0-10991 

0-085885 

0-069785 

0-031288 

11 

■27823 

■24748 

■21244 

■17207 

■14943 

■12484 

■098008 

•068675 

■036094 

12 

■30364 

■27126 

■23413 


=16638 

■13956 

■11006 

•077394 

■040967 

13 

■32737 

■29383 

■26492 

■20908 

•18288 

■16401 

■12199 

■086209 

■046889 

14 

■34981 

•31524 

■27481 

•22669 

■19895 

•16818 . 

■13377 

■094994 

■050847 

16 

0'37096 

0^33654 

0-29382 

0-24370 

0-21467 

0-18203 

0-14539 

0-10373 

0-055827 

18 

■39086 

•35480 

■31199 


■22972 

■19566 

■15682 

■11240 

■060821 

17 

■40966 

• 37307 ' 

■32936 

■27694 

■24441 

■20877 

•19805 

■12099 

■066820 

18 

■42738 

•39041 

■34596 


■26866 

■22164 

■17908 

■12950 

•070818 

19 

■44414 

•40689 

•36183 

Bm9 

•27244 

•23418 

■18989 

■13791 

■076809 

20 

0^48999 

0-42256 

0-37701 


0-28680 

0-24639 

0-20060 

0-14622 

0-080789 

21 

■47501 

■43746 

■39164 

■33376 

•29874 

•26828 

■21088 

■16442 

■085763 

22 

■48926 

•45166 

■40644 


■31126 

•26986 

■22106 

■16262 

•090698 

23 

■50276 

■46618 

■41877 

■36964 

■32340 

•28112 

■23102 

■17051 

■096621 

24 

■61560 

■47808 

■43154 


•33618 

■29208 

■24078 

■17838 

•10062 

25 

0^52782 

0^49040 

0'44379 

0-38373 

0-34663 

0-30276 

0-25032 

0-18615 

0-10539 

26 

■63946 

■50217 

■46654 

•39616 

•36766 

■31314 

■26966 

■19379 

■11024 

27 

■65054 

■51343 

■46683 


■36826 

•32325 

•26880 

•20133 

•11605 

28 

•66112 

■62420 

■47768 

■41686 

■37862 

•33309 

■27776 

■20876 

•11983 

29 

■67122 

■63462 

■48812 

■42715 

■38867 

■34267 

■28660 

■21606 

•12468 

30 

0^68088 

O ^ 64442 

0-49816 


0-39842 

0-36200 

0'29607 

0-22328 

0-12930 

40 

•66819 

•62460 

■58083 

■82099 

•48176 

■43321 

■37136 

•28936 

■17463 

60 

■76070 

■72282 

■68536 

■63186 

•59622 

■54807 

■48477 

■39468 

•26410 

120 

■86268 

■84504 

■82047 

■78342 

•78661 

■72016 

■66738 

■58329 

■42519 

00 

POOOOO 

lOOOOO 

1-00000 

1-00000 

1-00000 

1-00000 

1-00000 

1-00000 

1-00000 


For vi= CO, *=0 
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Percentage points of the incomplete beta-function 


Beta. Disteibutiqn: 2-6 peb cent Poikts poe x 

Vt = ip 




0'0“2l691 0-0n7782 0-046064 O'OnSOes 0'0ni683 
•010076 •0084038 •0072076 -0063096 -OOSeiOi 

•038748 -032820 '028471 -025143 •022613 

•078706 -067586 •059243 ■062746 ■047639 


•16696 'Uees 


•24933 ’22278 ■20161 •18406 •16944 

•28642 •26774 •23460 •21623 -lOSg? 


0'4781S I0'40866 I0'36877 I0’32071 0-29042 

•36234 -32086 

•38149 ‘34914 

•40838 •37646 

•43321 -SSOOl 

0'61149 I0'64628 |0’49641 I0-45618 0’42268 

•47748 '44390 

•49723 ■46372 

•51661 -48224 

•63276 •49969 


0-64877 0’61686 

•66375 ’SSI 16 

•67780 •84853 

•69100 'eegos 

•60341 ■67187 


0-61611 0'68396 

•62616 ’Sg^lO 

•67119 -63668 -60624 

■64646 -61662 

•66882 -62630 

0-66471 0’93669 

•73369 '70839 

•81166 .79193 

•89976 .88828 

1-00000 1-00000 I'OOOOO 


0‘26561 

•29482 

•32219 

•34779 

•37176 

0’39418 

•41620 

■43490 

■48341 

•47081 

0'48719 

•60263 

•61720 

•63098 

•54401 

0'66636 

•66808 

•67922 


0'60948 

•68532 

•77372 

•87743 

I'OOOOO 


0'24486 

•27288 

•29930 

•32416 

•34766 

0-36965 

•39026 

•40976 

•42814 

•44649 

0'46187 

•47736 

•49202 

■50692 

•51911 

0'63163 

•64364 

•66488 

■56568 

•67699 

0'68682 

•66411 

■75668 

■86708 

I'OOOOO 


, 0.22722 
■26409 
■27957 
•30368 
•32646 

|0'34799 

•36833 

•38766 

•40676 

•42297 

0'43928 

•45476 

■46943 

•48338 

•49664 

0'50927 

•82130 

•63278 

•84373 

•86420 

0'66421 

•64446 

•74066 

■86717 

1-00000 


This table gives the values of » for which Z, {p, g') = 0-026 where 
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Beta DiSTRiBUTiOfr : 2'6 pee cent Points eoe a; 


h=^S >’!=2p 





10 

12 

16 

20 

24 

30 

40 

60 

120 

1 

0-0n0323 

0'0‘86313 

0-0*67686 



0-0*33286 

iHi 



2 

•0060508 

•0042107 

•0033700 



•0016864 


•0384367 

■0342187 

3 

•020382 

•017139 

■013838 



•0070619 




4 

■043272 

■036693 

■029886 



•016614 



U|H| 

6 

0’070233 

0^060028 

0-049302 

0-038002 


0-026068 

iljl 



6 

■098988 

■086233 

•070663 

•064861 


•037986 

B'|!9 


jn !! H 

7 

■12818 

■11113 

■092696 

mi 


•060772 

B !!I9 

RM! 1 

B ! I9 

8 

•15701 

•13700 

•11808 

hH 


•064092 

B'l!9 


B ' 19 

fl 

■18604 

■10^40 

•13732 



•077712 

lliill 

B 1 

B III 

10 

0^2120I 

0^18709 

0-15917 



0-091466 

0-071319 

0-049628 

0-026864 

11 

■23780 

•21091 

■18048 

■14605 

•12623 

•10523 



K 1 

12 

■26238 

•23379 

•20116 



•11893 



Bl ^1 

13 

•28573 

•25671 

•22112 

•18067 

•15770 

•13249 

818109 

Ry lH 

K '1*1 

14 

•30790 

•27667 

•24039 

•19763 

•17299 

•14688 

•11673 

Bil 


IS 

0^32893 

0^29668 

0'26893 

0-21392 

[SilW 


0-12669 

0-090116 

0'048336 

. 16 

■34888 

•31678 

•27676 

■22983 


•17198 


■njmim 


17 

•36779 

■33400 

■29389 


•21674 

■18466 

•14823 


^^9 

18 

•38674 

■38138 

■31034 



■19708 

•16878 

•11444 

9 

19 

•40278 

•36797 

•32614 

•27405 


•20022 

•16916 

•12244 

BB|| 

20 

0^41896 

0'38380 

0'34132 

0'28864 

0-25713 


0- 17938 

0-13038 

0-07 1749 

21 

■43435 

■39893 

•36689 

•30218 

•26986 



•13823 


22 

•44900 

•41338 

■36990 

•31628 

•28221 

•24402 



B |II9 

23 

■46294 

•42720 

■38836 

•32796 






24 

•47623 

■44042 

•39629 



■26687 


bh 

B||j| 

25 

0^48891 

0'45307 

0^40874 



0-27640 

0-22783 

0-16881 

0-095166, 

29 

•50101 

•46620 

■42071 


•32821 

•28667 


•17622 


27 

•61267 

•47682 

■43223 

■37466 

•33890 

•29669 


•18364 

BfiinI 

28 



•44334 


•34928 

■30647 

•28476 


BfigiiB 


•63421 

•49867 


■39684 

•36937 


•26339 

•19789 

•11368 

30 

0^64436 

O-6O805 

0'46434 

0-40594 

0-36918 


0-27186 

0'20492 

0-118I2 

40 

•62616 

•69296 

■649.99 

•49168 


•40697 

•34780 


•16201 

60 

■72560 

■69743 

■65992 

•60674 



•46239 

■37498 



■84764 

■82964 


•76678 



BMOIrH 

■56668 

Bmpm 



1-00000 

I'OOOOO 

1-00000 



nil 

1-00000 

l-OOOOO 


For i'i= 00, a5=0 
























178 Percentage, points of the incomplete beta-function 


Beta Disteibptioit: 1 peb cent Points bob x 


V^=2q 


\ 

1 

2 

3 

4 

6 

6 

7 

8 

9 

1 

0-0''24672 

o-onoooo 

0-0‘61686 

0-0*44446 

0-0*34699 

0-0*28446 

0-0«24097 

0-0*30897 

0-0*18449 

2 

•019900 

■010000 

■0066778 

•0060126 

•0040121 

-0033445 

•0028674 

•0026094 

•0022309 

3 

•080827 

•046416 

•032834 

■026468 

■020807 

•017699 

•016262 

•013468 

•012043 

4 

■16878 

•10000 

•073960 

•058903 

•049014 

•041999 

•038754 

•032682 

■029426 

6 

0-23520 

0'15849 

0-12142 

0-098877 

0-083663 

0-072429 

0-063948 

0-067264 

0-061867 

6 

•30387 

•21544 

■16979 

•14087 

•12065 

•10564 

■094014 

■084730 

■077130 

7 

•30370 

•26827 

■21636 

•18230 

•15801 

•13969 

■12011 

•11341 

•10375 

8 

•41540 

•31623 

■25997 

•22207 

•19437 

■17307 

•15612 

•14227 

■13073 

9 

■40009 

•36938 

•30024 

•26946 

•22910 

•20543 

•18637 

•17066 

■16746 

10 

0-49889 

0'39811 

0-33719 

0-29431 

0-26191 

0-23632 

0-21661 

0-19820 

0- 18355 

11 

•63279 

•‘13288 

•37099 

•32667 

•29271 

•26660 

•24335 

•22469 

■20879 

12 

•60268 

•46416 

•40191 

•35064 

•32163 

•20323 

•26981 

■25003 

•23307 

13 

•68893 

•49239 

•43020 

■38437 

•34846 

■31924 

■29487 

■27417 

■25631 

■ 14 

•61238 

■51795 

•45616 

•41006 

•37358 

■34369 

■31858 

■29712 

■27851 

16 

0-63336 

0'64117 

0-47999 

0-43387 

0-39706 

0-36066 

0-34098 

0’31891 

O^29908 

16 

■65224 

•56234 

•50194 

■45697 

•41899 

•38826 

•36214 

•33958 

•31986 

17 

•66930 

•58171 

•62219 

•47051 

•43951 

•40857 

•38213 

•36920 

■33906 

18 

■68479 

•59948 

•54094 

•49666 

•46872 

•42768 

•40103 

■37781 

•36733 

19 

■69892 

•61586 

•56832 

■61360 

•47674 

•44668 

■41890 

•39547 

•37474 

20 

0-71185 

0'63096 

0-57447 

0-63018 

0-49366 

O-46206 

0-43581 

0-41 224 

0-39131 

21 

■72372 

■8449,5 

•68962 

■64581 

•80958 

■47868 

■46184 

■42818 

■40711 

22 

•73467 

■66793 

•60367 

•66040 

•62460 

■49383 

■46703 

■44333 

•42217 

23 

•74479 

■67002 

•61071 

•67422 

•53869 

•60816 

•48144 

•46776 

•43663 

24 

■76417 

■68129 

■62903 

•68717 

•65204 

•52174 

•49614 

•47149 

•45026 

26 

0'76290 

0-60183 

O-04O69 

0-59938 

0-56466 

0-63461 

0-50816 

0-48458 

0-46336 

26 

•77103 

•70170 

•66147 

■61090 

•57660 

•54683 

•52056 

•49700 

•47687 

27 

•77862 

•71097 

•66172 

•62180 

•58793 

■66845 

•63236 

■60899 

■48786 

28 

•78673 

•71969 

•67139 

■63211 

-69868 

■56961 

■64362 

•52038 

•49932 

29 

•79240 

•72790 

■08064 

•64188 

■00890 

•58004 

■65437 

•63127 

•61031 

30 

0-79867 

0-73584 

0-68919 

O-06n6 

O-01862 

0'69008 

0-60464 

0'64170 

0-62085 

40 

•84641 

•79433 

■75561 

•72316 

•69482 

•66950 

■04606 

■62566 

•60617 

60 

■89449 

•86770 

■82898 

•80433 

•78233 

■76227 

•74376 

•72661 

•71034 

120 

•94599 

■92612 

■91014 

■89607 

•88321 

■87124 

•85996 

•84924 

•83900 

00 

1-00000 

1-00000 

1-00000 

1-00000 

l-OOOOO 

1-00000 

l-OOOOO 

1-00000 

1-00000 


Thia table gives the values of m for which 4 (p, g) = O'Ol where p = ivj , ? = ivi . 
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Beta Distribution: 1 per cent Points for * 


>'i=2g Vi=;2p 


v 

I'd \ 

10 

12 

16 

20 

24 

30 

40 

60 

120 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 
10 

17 

18 

19 

20 
21 
22 

23 

24 

26 

26 

27 

28 

29 

30 
40 
60 

120 

CO 

0'0n6513 

■0020080 

■010898 

•026763 

0^047389 

■070804 

•095627 

•12096 

•14619 

0-17097 

■19606 

•21834 

•24073 

•26220 

0'28276 

■30240 

■32117 

■33910 

■36622 

0-37267 

■38818 

■40311 

■41738 

■43103 

0- 44410 
■45661 
■46861 
■48011 
•49116 

0^60176 

■58819 

•69511 

■82918 

1- 00000 

0-Qn3647 

■0016737 

■0091569 

■022665 

0040434 

■060840 

■082714 

■10526 

■12796 

0-16044 

■17260 

■19398 

•21479 

•23489 

0-26426 

•27289 

•29079 

•30797 

•32440 

0-34029 

•36648 

•37006 

■38405 

■39749 

0- 41040 
■42280 
■43473 
■44621 
•46726 

0'46789 

■65673 

■86701 

■81062 

1- 00000 

0-0n0827 

■0013391 

•0073877 

■018436 

0-033149 
■060258 
•088820 
•088177 
•10787 , 

0- 12760 
•14713 
•16633 
■18511 
•20338 

0-22113 

■23833 

■26497 

■27106 

■28668 

0'30157 

■31603 

•32999 

•34346 

•36646 

0-36899 

■38109 

•39278 

■40407 

■41497 

0- 42662 
•61398 
■62969 
•78497 

1- 00000 

0-0'80631 

•0010046 

‘0056887 

•014065 

0-026603 

•038982 

•063801 

■069466 

•085684 

0-10193 

•11830 

•13468 

■15066 

•10646 

O-18190 

•19711 

•21189 

•22630 

•24032 

0'25395 

■20721 

■28008 

■29258 

■30472 

O-31051 

•32795 

■33906 

■34085 

•36032 

0- 37049 
■46778 
■67717 
■74677 

1- 00000 

0-0'66831 

•0^83718 

•0046777 

•011824 

0-021634 

•033067 

•045816 

•059390 

•073472 

0-087838 

•10232 

•11681 

•13120 

•14544 

0-16948 

■17327 

■18681 

■20005 

■21301 

0-22687 

■23803 

■26008 

•26184 

■27329 

0-28446 

■29634 

•30694 

•31626 

■32632 

0-33612 

■42144 

•64167 

■71942 

HOOOOO 

0-0»63242 

■0’66980 

■0037688 

■0096436 

0-017469 

•026923 

•037481 

■048797 

•060623 

0-072776 

■085117 

■097542 

•10997 

■12235 

0-13462 

■14678 

•16873 

■17053 

■18212 

0-19351 

■20468 

■21663 

■22636 

■23687 

0-24710 

■26722 

■26707 

■27670 

■28612 

0- 29634 
■37700 
•49647 
•68269 

1- 00000 

0-0*39766 

■0»50239 

■0028317 

■0072226 

0-013275 

■020667 

•028767 

■037626 

■046956 

0-056621 

■066612 

■076547 

■086660 

■096802 

0-10693 

■11702 

■12704 

■13697 

•14680 

0-16651 

■16609 

■17554 

■18486 

■19403 

0-20305 

■21193 

■22066 

■22926 

•23768 

0- 24697 
■32111 
■43656 
•62988 

1- 00000 

0-0=26400 

■0=33496 

■0018904 

■0048696 

0'0089747 

•013973 

■019640 

■025815 

■032376 

0-039229 

■046303 

•053541 

■060897 

■068334 

0-076824 

■083341 

•090806 

■098383 

■10588 

011334 

■12070 

•12812 

■13543 

■14268 

0-14986 

■16697 

•16401 

■17096 

•17785 

0- 18466 
■24819 
•35268 
■64709 

1- 00000 

0-0=13146 

■0=16749 

■0=96262 

■0024626 

0'0046620 

■0071236 

■010066 

•013300 

■016768 

0-020426 

•024237 

■028173 

■032212 

•036336 

0-040526 

■044772 

■049062 

■063386 

■067738 

0-062109 

■066494 

■070888 

■075286 

•079683 

0-084077 

•088466 

■092843 

•097210 

■10156 

0- 10590 
■14811 
•22459 
•39479 

1- 00000 


For yj= CO, ®=0 



180 Pmmtage points of ihe incomplete leta-fmction 


Beta Distbibution: 0-6 per cent Points fob x 
h’=^ I',= 2j5 


12 


13 

'63337 

14 

■66866 

16 

0'68144 

16 

•60200 

17 

•62080 

18 

■63789 

19 


20 


21 

BlBH 

22 

‘69341 

23 

•70477 

24 

•71632 

26 

0-72616 

26 

■73434 

27 

•74294 

28 


29 

•76867 



120 -esoio •91648 '39893 -88442 -87120 '80892 '84739. '836 

00 I'OOOOO I'OOOOO I'OOOOO I'OOOOO 1-00000 I'OOOOO 1-00000 I'OOC 

This table gives the values of x for which I, (p, j)=0'006 where p=h.,q=:i 
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Beta Disteibhtion: 0-6 per cent Points foe x 


h=k j.,=2}) 


V|\ 

10 

12 

16 

20 

24 

30 

40 

B 

120 

1 

0'0'41280 

00'34116 

0-0S27067 

0-0»20132 

0-0'16707 

0-0513310 

llil 


0-0*32862 

2 

■0010020 

•0283507 

■0366812 

■flW113 

•0“41762 

•0»334H 




3 

■0068204 

•0067290 

■0046206 


•0029242 

■0023493 




4 

■018721 

■016844 

•012879 


•0082522 





6 

0'0354I5 

0-030191 

0'024729 

0019006 

0-016040 



RH 


8 

•066299 

■047464 

•039162 

•030337 

•026709 

•020924 


n 


7 

•077090 

■066693 

•066329 

•04'3189 

•038749 



B 


8 

■099867 

•088787 

•072888 

•057076 

•048760 


•030828 

B' 1^9 


9 





•061433 

IBI 


BIB 

•013963 

m 

0^14606 

0-12831 

0-10862 

O-O80595 

0-074540 

0-061684 

0-047930 

0-033162 

0-017241 

11 

■16876 


•12683 

mam 

•087903 



■mia 


12 

■19092 

■16931 

•14489 

■Bl 

•10139 


•066252 

■046266 


13 

■21242 

•18919 

•16270 

HI 

•11490 


•076662 

Im 


14 

•23320 




•12836 

■m 


Hy 


16 



0-19726 


0-14170 


0-094697 

0-067019 

0-036743 

16 

■27248 

•24643 

•21388 

•17644 

•16488 





17 





■16787 

•14241 

•11377 


•043741 

18 

•30872 

■27986 

•24676 


•18065 

•15373 

■12324 


•047816 

19 

•32674 

■29616 


•21829 

■19319 

•16489 

■13266 



20 

0-34208 

0-31184 



0-20649 


IRQ 



21 


•32696 


•24458 

;21763 

•18673 




22 

■37289 

•34151 

‘30387 

•26723 

•22932 

■19738 


•11676 


23 


•38852 

•31728 

•26964 

■24084 


•16938 


•068619 

24 

BB 

•36901 


•28153 

•25210 

•21811 

•17820 

1^9 

mmm 

26 


0-38200 

0-34272 

0'29320 

0-26309 


IQ 



26 

•42682 

■39452 

■35484 

KUHjtH 

•27383 



■14461 


27 



•36667 


•28432 

•24776 


•15143 

•085469 

28 


■41821 

■37792 


•29468 

•25724 

•21266 

■16819 


29 

B9 

■42942 

■38891 

•33681 

•30464 

•26664 


•16489 


m 





0-31429 

0'27666 




40 


■63024 


•43493 

•39980 

■36700 




11^ 

•67384 

■64684 


•65688 

•62194 

•47762 

•41913 

■33769 

■21421 




•77126 

Bl 

■70631 

•66846 

MM 


IBHI 

m 




■ 

1-00000 

1-00000 

■ 

■1 

inn 


For vi= oo, !»=0 
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TABLE OF LAGRANGIAN COEFFICIENTS FOE, HARMONIC 
INTERPOLATION IN CERTAIN TABLES OF 
PERCENTAGE POINTS 

Prepared by L. J. COMRIE asd H. 0. HARTLEY 


This table was prepared to facilitate interpolation in the tables of percentage points of the 
Incomplete Beta-F\mction in. the preceding paper. As, however, the -mHin part of the table 
may be applied to interpolation in other tables with a similar lay-out, it is published 
separately. 

The table is based on harmonic interpolation — a device introduced by R. A, Fisher in 
his tables of percentage points of the distribution of z, whose two parameters (degrees of 
freedom) Wj and range from 1 to oo. It consists in using values of and in harmonic 
progression, so that, with l/nj and 1/n.j as variables, z is tabulated at equidistant intervals 
near l/n-i = 0 and also near l/n^ = 0. This transformation renders the z-tahle (apart from 
its singularity at Wj = oo, », = oo) interpolable. As the percentage points of the Incomplete 
Beta-Function show a similar behaviour,* the values of and near the margin of the 
tables have been chosen in harmonic progression. Harmonic interpolation is, in fact, 
applicable to any table of percentage points (depending on a parameter n with an infinite 
range) in which the statistic can be adequately represented as a polynomial in l/n.f It is 

Table of LagraTigian Coefficients 

Column headings are the argnmonts of tabular values and 
row headings the arguments of the interpolate. 





Ordinary 





7 

8 

9 

10 

12 

16 

20 



•f" 

- 

-t 

+ 

- 

. + 

11 

0069 231 

0-428 671 

1-090 909 

1-440 000 

0-300 000 

0-008 671 

0-000 140 


8 

9 

10 

12 

16 

20 

24 





+ 

+ 


+ 

13 

0-171 876 

0-777 778 

1-100 000 

1-336 806 

0-162 963 

0-006 260 

0-000 679 

14 

0-223 214 

0-969 697 

1-286 714 

1-041 667 

0-607 936 

O-OLl 364 

0-000 992 


9 

10 

12 

15 

20 

24 

30 


_ 

+ 


+ 

H' 

- 

-1- 

16 

0-172 391 

0-448 000 

0-604 938 

1-238 914 

0-106909 

0-017 284 

0-000 790 

17 

0-306 397 

0-780 000 

0-983 026 

1-268 272 

0-289 646 

0-040 124 

0-001 728 

18 

0-332 468 

0-833 143 

1-000 OOO 

1-024000 

0-630 182 

0-067 143 

0-002 286 

19 

0-222 222 

0-660 000 

0-636 674 

0-670 370 

0-787 600 

0-060926 

0-001 862 




Harmonic 





10 

12 

16 

20 

24 

30 

40 



— 

"h 

+ 

- 

-h 

•- 

16 

0-003 062 

0-039 661 

0-692 139 

0-769 043 

0-632 813 

0-247 192 

0-039 062 

17 

0-003 097 

0-037 469 

0-409 711 

1-213 968 

0-866 212 

0-315 162 

0-048 267 

18 

0-001 096 

0-022 993 

0-201 189 

1-341 269 

0-736 777 

0-261 486 

0-037 160 

19 

0-000 818 

0-009 091 

0-069 601 

1-237 346 

0-407 263 

0-126 647 

0-017 957 


* This issue, pp. 168-81. 

■j- It can bo shown that any so-called “studentized” statistic has this property. 





21 

22 

23 


25 

26 

27 

28 
29 


31 (15-6) 

32 (16'0) 

33 {16'6) 
34(17'0) 

36 (17:6) 
36 (18-0) 
37(18'6) 
3 S (19-0) 
39 (19-6) 


41 {20-6) 
42(21'0) 

43 (21 '6) 

44 (22-0) 

46 {22-6) 

46 (23-0) 

47 (23-6) 

48 (24-0) 

49 (24'6) 

60 (26-0) 
61 (26-6) 
62 (26-0) 
53 (26-6) 
64 (27-0) 

66 (27-6) 
66 (28-0) 

67 (28-6) 

68 (29-0) 

69 (29-6) 


61 (30'6) 

62 (31-0) 

63 (31-6) 

64 (32-0) 

66 {32-6) 

66 (33-0) 

67 (33-6) 

68 (34-0): 

69 (34-6) 


Table of Lagfangian Coefficients ( continued ) 


Harmonic 


12 

0-000 600 
0-000 579 
0-000 306 

16 

-t- 

0-000 730 
0 000 993 
0-000 927 
0-000 670 
0-000 336 

20 

( 10 ) 

4 - 

0-003 664 
0-006 876 
0-006 876 
0-006 948 

0-006 368 
0-005 336 
0-004 062 
0-002 684 
0-001 306 


0-001 182 
0-002 210 
0-003 069 
0-003 764 

0-004 268 
0-004 619 
0-004 819 
0-004 883 
0-004 824 

0-004 669 
0-004 402 
0-004 068 
0-003 669 
0-003 220 

0-002 730 
0-002 210 
O-OOJ 669 
0-001 116 
0-000 668 

+ 

0-000 652 
0-001 093 
0-001 619 
0-002 128 

0-002 616 
0-003 082 
0-003 624 
0-003 940 
0-004 329 


16 

0-010 497 
0-009 653 
0-004 908 

20 

0-040 868 
0-050 984 
0-044 488 
0-030 498 
0-014 607 

24 

( 12 ) 

0-041 469 
0-063 446 
0-071 603 
0-070 033 

0-062 423 
0-061 212 
0-038 260 
0-024 848 
0-011 904 


0-010 611 
0-019 448 
0-026 748 
0-032 432 

0-036 680 
0-039 302 
0-040 733 
0-041 016 
0-040 293 

0-038 707 
0 036 392 
0-033 473 
0-030 006 
0-026 273 

0-022 191 
0-017 901 
0-013 477 
0-008 983 
0-004476 


0-004 402 
0-008 696 
0-012 864 
o-xne 863 

0-020 675 
0-024 304 
0-027 730 
0-030 945 
0-033 942 


20 

-4 

0-629 840 
0-337 838 
0-130 868 

24 

■h 

0-817 162 
0-611 813 
0-416 220 
0-243 980 
0-108 172 

30 

(16) 

■{- 

0-906 909 
0-763 076 
0-670 341 
0-547 129 

0-420 158 
0-320 073 
0-221 984 
0-136 887 
0-061 908 


0-060 764 
0-091 161 
0-122 164 
0-144 787 

0-160 037 
0-168 878 
0-172 219 
0-170 899 
0-166 679 

0-167 248 
0 146 218 
0-133 132 
0-118 464 
0-102 630 

0-085 989 
0-008 849 
0-061 474 
0-034 088 
0-016 878 

4 - 

0-016 417 
0 032 268 
0-0 i 7 471 
0-061 959 

0-076 683 
0-088 608 
0-100 708 
0-111 971 
0-122 338 


24 

+ 

0-637 463 

0- 864 864 

1- 006 008 

30 

+ 

0-306 432 
0-673 675 
0-778 637 
0-914 926 
0-086 984 

40 

( 20 ) 

+ 

0-179 142 
0-352478 
0-610 736 
0-648 460 

0-762 947 
0-863 629 
0-920 823 
0-966 308 
0-901 060 

+ 

0-992 724 
0-072 384 
0-941 117 
0-900 900 

0-853 629 
0-800 606 
0-743 649 
0-683 894 
0-621 808 

0-669 104 
0-496 256 
0-433 910 
0-372 604 
0-312 777 

0-264 781 
0-198 807 
0-146 339 
0-094 268 
0-046 797 


0-043 083 
0-083 441 
0-121 086 
0-166 046 

0-188 368 
0-218 113 
0-246 348 
0-270 161 
0-292 606 


30 

0-209 947 
0-263 378 
0-168 269 

40 

0-108 964 
0-174 804 
0-101640 
0-162 663 
0-096 611 

60 

(30) 

0-062 546 
0-113 297 
0-148 964 
0-1 68 348 

0-171 663 
0-160 037 
0-136 121 
0-098 827 
0-063 141 

+ 

0-068 780 
0-121 548 
0-186 839 
0-263 378 

0-320 073 
0-386 006 
0-460 419 
0-612 696 
0-672 340 

0-628 992 
0-682 361 
0-732 223 
0-778 476 
0-821 038 

0-869 880 
0-896’ 036 
0-026 636 
0-964 463 

0- 978 914 

•f* 

1- 017 846 
1-032 686 
1-044 367 
1-063 303 

1-069 670 
1-063 300 
1-064 636 
1-063 721 
1-060 092 


40 

■i- 

0-060 616 
0-088 640 
0-042 230 

60 

+ 

0-020 184 
0-044 980 
0-047 184 
0-038 122 
0-021 204 

120 

(60) 

+ 

0-016 304 
0-028 839 
0-036 984 
0-040 717 

0-040 301 
0-036 680 
0-029 965 
0-021 212 
0-011 032 


0-011 310 
0-022 440 
0-033 000 
0-042 674 

0-061 212 
0-068 422 
0-064 169 
0-068 369 
0-070 939 

0-071 886 
0-071 202 
0-068 918 
0-066 067 
0-060 712 

0-062 916 
0-044 762 
0-036 297 
0-024 631 
0-012 838 

+ 

0-013 801 
0-028 485 
0-043 973 
0-060 189 

0-077 060 
0-094 616 
0-112 490 
0-130 919 
0-149 746 


60 

0-008 076 
0-008 890 
0-006 306 

120 

0-003 686 
0-006 670 
0-006 740 
9-004 646 
0-002 477 


00 

(cO) 


0-002 

016 

0-003 

626 

0-004 

469 

0-004 

883 

0-004 

768 

0-004 

268 

0-003 

463 

0-002 

416 

0-001 

240 

‘H 

0-001 

241 

0-002 

431 

0-003 

529 

0-004 

605 

0-006 

336 

0-006 

006 

0-006 

606 

0-006 

836 

0-000 

096 

0-006 

989 

0-006 

824 

0-006 

509 

0-006 

066 

0-005 

474 

0-004 

777 

0-003 

978 

0-003 

088 

0-002 

131 

0-001 

088 

0-001 

131 

0-002 

296 

0-003 

481 

0-004 

681 

0-006 

886 

0-007 

089 

0-008 

281 

0-009 

456 

O-OIO 

607 


Table of Lagra/ngum, Goeffadents (continued) 


Hannonio 



20 

24 

30 

40 

60 

120 

00 


(10) 

■+• 

(12) 

(16) 

(20) 

(30) 

(60) 

(to) 

70 (36-0) 

0.004 692 

0-036 719 

0-131 960 

0-312 796 

+ 

1-056 683 

+ 

0‘168 909 

0-011 730 

71 (36-6) 

0.006 027 

0-039 276 

0-140 696 

0-330 812. 

1-048 823 

0-188 360 

0-012 819 

72 (36 0) 

0.006 336 

0-041 610 

0-148 006 

0-346 746 

1-040238 

0-208 048 

0-013 870 

73 (36'6) 

0 006 616 

0-043 726 

0-166 706 

0-360 690 

1-030 048 

0-227 926 

0-014 878 

74 (37.0) 

0-006 867 

0-046 623 

0-182 013 

0-372 737 

1-018 370 

0-247 961 

0-016 841 

76 (87-6) 

0.006 093 

0-047. 309 

0-167 662 

0-382 976 

1-006 312 

0-208 083 

0-016 766 

76 (38.0) 

0-006 292 

0-048 787 

0-172 346 

0-391 499 

0-990 981 

0-288 286 

0-017 6I7 

77 (38-6) 

0-006 465 

0-050 062 

0-176 416 

0-398 393 

0-976 477 ■ 

0-308 623 

0-018 426 

78 (39-0) 

0 006 613 

0-061 141 

0-179 793 

0-403 746 

0-968 894 

0-328 764 

0-019 178 

79 (39-6) 

0-006 736 

0-062 030 

0-182 602 

0-407 639 

0-941 324 

0-348 979 

0-019 872 

80 (40.0) 

0-006 836 

0-062 734 

0-184.670 

0-410 166 

0'-922 861 

0-369 141 

0-020 608 

81 (40-6) 

0-006.912 

0-063 282 

0-186 027 

0-411 376 

0-903 667 

0-389226 

0-021 083 

82 (41-0) 

0-006 967 

0-063 621 

0-186 898 

0-411 373 

0-883 618 

0-409 208 

0-021 597 

83 (41-6) 

0-007 000 

0-063 816 

0-187 212 

0-410 223 

0-862 806 

0-429 071 

0-022 049 

84 (42-0) 

0 007 012 

0 063 866 

0-186 997 

0-407 993 

0-841 486 

0-448 793 

0-022 440 

86 (42.6) 

0-007 006 

0 063 746 

0-186 279 

0-404 763 

0-819 626 

0-468 367 

0-022 767 

86(43.0) 

0-006 980 

0-063 496 

0-186 083 

0-400 667 

0-797 283 

0-487 749 

0-023 033 

87 (43.6) 

0-006 936 

0-063 110 

0-183 437 

0-396 496 

0-774 614 

0-606 964 

0-023 236 

88 (44.0) 

0-006 876 

0-062 698 

0-181 366 

0-389 600 

0-761 S'?! 

0-626 960 

0-023 376 

89 (44.5) 

0-006 798 

.0-061 961 

0-178 892 

0-382 934 

0-727 903 

0-544 766 

0-023 466 

90 (46.0) 

0-006 706 

0-061 212 

0-176 040 

0-376 662 

0-704 161 

0-663 329 

0-023 472 ■ 

91 (46.6) 

0 000 600 

0-060 364 

0-172833 

0-387 606 

0-680 182 

0-681 673 

0-023 428 

92 (46 0) 

0-006 479 

0-049 393 

0-169 293 

0-368 843 

0-666 009 

0-609 780 

0-023 326 

93 (46.6) 

0-006 346 

0-048 337 

0-166 440 

0-349 609 

0-631 680 

0-617 642 

0-023 162 

04 (47.0) 

0-006 200 

0-047 190 

0-161 296 

0-339 848 

‘0-607 220 

0-636 254 

, 0-022 940 

96 (47.6) 

0-006 043 

0-046 969 

0-166 878 

0-329 602 

0-682 689 

0-662 011 

0-022 660 

96 (48.0) 

0-006 876 

0-044 647 

0-162 207 

0-318 909 

0-668 090 

0-669 708 

0-022 324 

97 (48.6) 

0-006 696 

0-043 262 

0-147 299 

0-307 806 

0-633 462 

0-686 642 

0-021 931 

98 (49.O) 

0-006 609 

0-041 806 

0-142 173 

0-296 330 

0-608 829 

0-703 109 

0-021 484 

99 (49-6) 

0-006 312 

0-040 287 

0-136 844 

0-284 611 

0-484 217 

0‘719 408 

0-020 983 

100 (60.0) 

0-006 107 

0-038 707 

0-131 328 

0-272 384 

0-469 648 

0-736 437 

0-020 429 

101 (60.6) 

0 004 894 

0-037 072 

0-126 640 

0-269 976 

0-436 143 

0-751 194 

0-019 823 

102 (81.0) 

0-004 676 

0-036 386 

0-119 793 

0-247 316 

0-410 721 

0-766 679 

0-019 167 

103 (61.6) 

0-004 448 

0 033 661 

0-113 802 

0-234 429 

'0-386 400 

0-781 891 

0-018 461 

104 (62.0) 

0-004 216 

0-031 873 

0-107 680 

0-221 342 

0-362 196 

0-796 831 

0-017 708 

106 (62.6) 

0-003 978 

0-030 066 

0-101 437 

0-208 077 

0-338 126 

0-811 499 

0-016 906 

106(63.0) 

0-003 736 

0-028 201 

0-096 087 

0-194 666 

0-314 199 

0-826 895 

0-016 069 

107 (63.6) 

0-003 486 

0-026 314 

0-088 039 

0-181 099 

0-290 433 

0-840 022 

0-016 167 

108 (64-0) 

0-003 234 

0-024 307 

0-082 104 

0-167 427 

0-266 837 

0-863 880 

0-014 231 

109 (64.6) 

0-002 978 

0-022 462 

0-076 492 

0-163 068 

0-243 423 

0-867 470 

0-013 263 

110 (66.0) 

0-002 719 

0-020 484 

0-068 812 

0-139 809 

0-220 199 

0-880 796 

0-012 233 

111 (66.6) 

0-002 466 

0-018 494 

0-062 074 

0-126 896 

0-197 176 

0-893 868 

0-011 173 

112 (68.0) 

0-002 190 

0-016 485 

0-066 284 

0-111 933 

0-174 368 

0-906 660 

0-010 074 

113 (66 6) 

0-001 922 

0-014 469 

0-048 462 

0-097 930 

0161 766 

0-919 203 

0-008 937 

114 (67.0) 

0-001 661 

0-012 420 

0-041 684 

0-083 918 

0-129 374 

0-931 491 

0-007 762 

116 (67.6) 

0-001 379 

0-010 368 

0-034 688 

0-069 891 

0-107 219 

0-943 626 

0-006 562 

116 (68.0) 

0-001 106 

0-008 307 

0-027 770 

0-066 866 

0-086 296 

0-956 309 

0-005 307 

177 (68.6) 

0-000 831 

0-006 238 

0-020 837 

0-041 864 

0-063 808 

0-966 846 

0-004 029 

118 (60.0) 

0-000 664 

0-004 162 

0-013 894 

0-027 867 

0-042 161 

0-978 137 

0-002 717 

119 (60.6) 

0-000 278 

0-002 083 

0-006 947 

0-013 913 

0-020 967 

0-989 188 

0-001 374 
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186 Lagrangian coefficients for harmomc interpolation 

particularly convonient if the high-order terms are so small that linear interpolation 
suffices. Unfortunately, however, if the interpolate is required to the same accuracy as the 
tabular' values, linear interpolation in the above-mentioned tables is inadequate; hence this 
table has been prepared as the simplest method of preserving tabular accuracy in the 
interpolates. The interpolate is the sum of seven products of which the seven (Lagrangian) 
multipliers are taken from this table whilst the multiplicands are tabular entries in 
the table of percentage points. The examples below illustrate the use of the table. 

The calculation of the Lagrangian coefficients follows the standard formulae for inter- 
polation by a polynomial of the sixth degree. Where interpolation is harmonic the coeffl. 
cients are those for polynomials of the sixth degree in the reciprocal of the parameters 
used as argument. 

In the early part of the table, however, ordinary Lagrangian coefficients are given, 
since the polynomial in the parameter itself is preferable in this range. The part of the table 
with row headings less than 30 has been specifically designed to meet the requirements of 
the tables of percentage points of the Incomplete Beta Function. In particular it will he 
noted that there are two rows for 16, 17, 18 and 19, one giving harmonic and the other 
ordinary Lagrangian coefficients ; the application of both rows affords a good check, as 
will be seen from Example 2 on p. 162 in the preceding paper. 

The table may be used not only for the progression 10, 12, 16, 20, 24, 30, 40, 60, 120, oo, 
but also for aubmultiple progressions. The most important of these, namely 10, 12, 16, 20, 
30, 60, c», is obtained by halving the last seven terms, and is catered for by the auxiliary 
arguments in brackets. Division by 4 yields the progression 6, 6, 7-6, 10, 16, 30, oo, while 
division by 6 yields 3, 4, 4-8, 6, 8,’ 12, 24, oo, from which we can select the first seven or the 
last seven values. 

The missing values 7-6 or 4-8 can be found by ordinary interpolation from values in 
their immediate neighbourhood. If linear interpolation does not suffice we may use 

/(7.6) =i»s{-/( 0) + 9/(7) + 9/(8)-/(9)}, 
which takes third differences into account, and 

/(4-8) = 0-12/(4)-fO-96/(6)-0-08/(6), 

which takes second differences into accotmt. Since /(7’6) and/(4-8) are not required to full 
tabular accuracy these formulae will, as a rule, suffice. 


Example 1. Find the 0'5 % point of the Incomplete Beta Function corresponding to 
Vi = 4 and = 96. 

In the accomp'anyirig table enter row 96, which gives the Lagrangian multipliers. The 
corresponding multiplicands are taken from' the column of the table of 0-6 % points on 
®(96, 4) = -hO- 491 44 xO-005 876 p. 180 and are the entries for = 20, 24, 30, 40, 60, 120 

and 00 , which correspond to the column headings in the 
Lagrangian table. The sign of each product is also given 
at the top of the columns. We have, therefore, the scheme 
shown alongside. The result is two units greater in the 
fifth decimal than the exact value obtained by inverse 
interpolation in Pearson’s tables. 

Example 2. In Fisher’s table of 1 % points of the distribution of z find tho point 
«(I2, 64) = +0'7744 x 0'003 23 corresponding to n^= 12 and n, = 64. 

In Fisher’s table we have the harmonic progression 
10, 12, 16, 20, 30, 60, oo, for which our table provides 
by means of the arguments in brockets. Using the 
Lagrangian multipliers, and the tabular entries corre- 
sponding to the bracketed column headings in row (64), 
wo have the values alongside. 


-0’660 98 x 0'044 647 
-)-0'618 64x0' 162 207 
-0'696 71 x0'318 909 
-)-0-783 70 x 0'668 090 
-^0•884 42x0-669 708 
-1-000 00 x 0-022 324 
= 0-867 94 


-0-7122 x 0-024 40 
-t-0-6496 x 0-082 10 
-0-6864 x 0-167 43 
■1-0-6224 x 0-266 84 
-f 0-4674 X 0-863 88 
-0-3908 x 0-014 23 
= 0-4647 
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TABLE OF PERCENTAGE POINTS OF THE 
f- DISTRIBUTION 

Calculated by CATHERINE M. THOMPSON 
Editobial 

In the calculation of the percentage points of the incomplete A-function 
Ix{p> 7 ) large values of and q described in the preceding paper, use has been 
made of the relation between this function and the incomplete /’-function 
I{u, 3 )*)t which was tabulated by Karl Pearson ( 2 ). Since the incomplete P-func- 
tion is related to the probabihty integral of it was decided that the calculation 
of the percentage points of u already carried out should be extended to form 
complete tables of percentage points of Before describing the method of com- 
putation used in deriving these tables it is desirable to relate them to existing 
tables and to define the relation between the functions. 

In common terminology the probability distribution of having v degrees 
of freedom may be written 

The probability integral of or the chance that this quantity exceeds a 
given value -}^ is then 

P = W)= f(X^)dx^- (2) 

Jx* 

Conversely, for given degrees of freedom v, the integral (2) will be equal to 
a given probabihty level P for one particular lower hmit This lower hmit is 
called the percentage point corresponding to v and P ; it wUl be denoted by ;\^(P). 

These percentage points y^(P) were first tabulated by R. A. Fisher (i) for 
P == 0-99, 0-98, 0-96, 0-90, 0-80, 0-70, 0-60, 0-30, 0-20, 0-10, 0-06, 0-02, O-Oli and 
for v = 1 (1)30. Most entries in the body of Fisher’s table are given to three 
decimal accuracy (i.e. four to five significant figures) but more decimals are given 
for small percentage points which are given to three-figure accuracy. In the table 
which follows ;^(P) is tabulated for P = 0-996, 0-99, 0-976, 0-96, 0-90, 0-76, 0-60, 
0-25, 0-10, 0-06, 0-026, 0-01, 0-006 (which are the levels used for q)) whilst 
the range of the degrees of freedom has been extended up to y = 100. The per- 
centage points are given to six significant figures, although for v > 60 the sixth 
figure may be in error by one or two units. 

f Karl Pearson’s notation was /(«, p). To avoid confusion with the parameter p in Ijpi S') we 
have added the asterisk. 

J In a later edition the level P = 0-001 was added. 

I3-* 
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Table of ^percentage points of the distribution 


Table of percentage points of the distribution 



0-996 

0-990 

0-976 

0-960 

0-900 

0-750 

1 

392704.10-’“ 

157088.10-* 

982069. 10-» 

303214.10-“ 

0-0167908 

0-1015308 

2 

0-0100251 

0-0201007 

0-0606356 

0-102687 

0-210720 

0-576364 

3 

0-0717212 

0-114832 

0-215796 

0-361846 

0-684376 

1-212634 

4 

0-206990 

0-297110 

0-484419 

0-710721 

1-063623 

1-92266 

6 

0-411740 

0-664300 

0-831211 

1-146476 

1-61031 

2-67460 

6 

0-676727 

0-872086 

1-237347 

1-63639 

2-20413 

3-46460 

7 

0-989266 

1-239043 

1-68987 

2-16736 

2-83311 

4-26486 

8 

1-344419 

1-646482 

2-17973 

2-73264 

3-48064 

6-07064 

9 

1-734926 

2-087912 

2-70039 

3-32611 

4-16816 

6-89883 

10 

2-16686 

2-56821 

3-24697 

3-94030 

4-86518 

6-73720 

11 

2-60321 

3-06347 

3-81576 

4-67481 

6-67779 

7-68412 

12 

3-07382 

3-67066 

4-40379 

6-22603 

6-30380 

8-43842 

13 

3-56503 

4-10691 

6-00874 

6-89186 

7-04160 

9-29906 

14 

4-07468 

4-66043 

6-62872 

6-67063 

7-78063 

10-1653 

16 

4-60094 

6-22935 

6-26214 

7-26094 

8-64676 

11-0366 

16 

5-14224 

6-81221 

6-90766 

7-96164 

9-31223 

11-9122 

17 

6-69724 

6-40776 

7-66418 

8-67176 

10-0862 

12-7919 

IS 

6-26481 

7-01491 

8-23076 

9-39046 

10-8649 

13-6763 

10 

6*84398 

7-63273 

8-90666 

10-1170 

11-6609 

14-6620 

20 

7-43386 

8-26040 

9-69083 

10-8608 

12-4426 

16-4618 

21 

8-03366 

8-89720 

10-28293 

11-6913 

13-2396 

16-3444 

22 

8-64272 

9-54249 

10-9823 

12-3380 

14-0416 

17-2396 

23 

9-26042 

10-19567 

11-6886 

13-0906 

14-8479 

18-1373 

24 

9-88623 

10-8664 

12-4011 

13-8484 

16-6687 

19-0372 


10-6197 

11-6240 

13-1197 

14-6114 

16-4734 

19-9393 

26 

11-1603 

12-1981 

13-8439 

16-3791 

17-2919 

20-8434 

27 

11-8076 

12-8786 

14-6733 

16-1613 

18-1138 

21-7494 

28 

12-4613 

13-6648 

15-3079 

16-9279 

18-9392 

22-6672 

29 

13-1211 

14-2666 

16-0471 

17-7083 

19-7677 

23-6666 

30 

13-7867 

14-9635 

16-7908 

18-4926 

20-6992 

24-4776 

40 

20-7066 

22-1643 

24-4331 

26-6093 

29-0606 

33-6603 

60 

27-9907 

29-7067 

32-3674 

34-7642 

37-6886 

42-0421 

60 

36-6346 

37-4848 

40-4817 

43-1879 

46-4689 

62-2938 

70 

43-2762 

46-4418 

48-7676 

61-7393 

66-3200 

61-6983 

80 

51-1720 

63-5400 

67-1632 

60-3916 

64-2778 

71-1446 

90 

59-1963 

61-7641 

66-6466 

69-1260 

73-2912 

80-6247 

100 

67-3276 

70-0648 

74-2219 

77-9296 

82-3681 

90-1332 

Vf 

-2-6768 

-2-3263 

-1-9600 

-1-6449 

-1-2818 

-0-6746 


For 30 < V < 100 intorpolation formulae (6) or (7) of the Introduction may bo used. 
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Table oe peeoentagb points oe the - x ^ distribution [continueA) 


>< 

0-600 

0-260 

0-100 

0-060 

0-026 

0-010 

0-006 

1 

0-454937 

1-32330 

2-70664 

3-84146 

6-02389 

6-63490 

7-87944 

2 

1-38629 

2-77269 

4-60617 

5-99147 

7-37776 

9-21034 

10-5966 

3 

2-36697 

4-10836 

6-26139 

7-81473 

9-34840 

11-3449 

12-8381 

4 

3-35670 

6-38527 

7-77944 

9-48773 

11-1433 

13-2767 

14-8602 

6 

4-36146 

6-62668 

9-23635 

H-0706 

12-8325 

16-0803 

16-7496 

6 

5-84812 

7-84080 

10-6446 

12-6916 

14-4494 

16-8119 

18-6476 

7 

6-34681 

9-03715 

12-0170 

14-0671 

16-0128 

18-4753 

20-2777 

8 

7-34412 

10-2188 

13-3616 

16-5073 

17-5346 

20-0902 

21-9660 

9 

8-34283 

11-3887 

14-6837 

16-9190 

19-0228 

21-6000 

23-6893 

10 

9-34182 

12-6489 

16-9871 

18-3070 

20-4831 

23-2093 

26-1882 

11 

10-3410 

13-7007 

17-2760 

19-6761 

21-9200 

24-7260 

26-7569 

12 

11-3403 

14-8464 

18-5494 

21-0261 

23-3367 

26-2170 

28-2995 

13 

12-3398 

16-9839 

19-8119 

22-3621 

24-7366 

27-6883 

29-8194 

14 

13-3393 

17-1170 

21-0642 

23-6848 

26-1190 

29-1413 

31-3193 

16 

14-3389 

18-2461 

22-3072 

24-9968 

27-4884 

30-6779 

32-8013 

10 

16-3386 

19-3688 

23-5418 

26-2962 

28-8454 

31-9999 

34-2672 

17 

16-3381 

20-4887 

24-7000 

27-6871 

30-1910 

33-4087 

36-7186 

18 

17-3379 

21-0049 

26-9894 

28-8693 

31-6264 

34-8063 

37-1664 

19 

18-3376 

22-7178 

27-2036 

30-1436 

32-8523 

36-1908 

38-5822 

20 

19-3374 

23-8277 

28-4120 

31-4104 

34-1696 

37-6662 

39-9968 

21 

20-3372 

24-9348 

29-8151 

32-6705 

35-4789 

38-9321 

41-4010 

23 

21-3370 

20-0393 

30-8133 

33-9244 

36-7807 

40-2804 

42-7966 

23 

22-S309 

27-1413 

32-0069 

36-1726 

38-0757 

41-6384 

44-1813 

24 

23-3307 

28-2412 

33-1963 

36-4161 

39-3641 

42-9708 

45-6686 

26 

24-3366 

29-3389 

34-3816 

37-6525 

40-6466 

44-3141 

46-9278 

26 

26-3364 

30-4345 

36-6631 

38-8862 

41-6232 

46-6417 

48-2899 

27 

26-3363 

31-6284 

36-7412 

40-1133 

43-1944 

46-9630 

49-6449 

28 

27-3363 

32-6206 

37-9169 

41-3372 

44-46P7 

48-2782 

' 60-9933 

29 

28-3362 

33-7109 

39-0876 

42-6669 

45-7222 

49-6879 

62-3356 

30 

29-3360 

34-7998 

40-2560 

43-7729 

46-9792 

60-8922 

63-6720 

40 

39-3364 

46-6160 

61-8060 

65-7586 

69-3417 

63-6907 

66-7669 

60 

49-3349 

66-3336 

63-1671 

67-5048 

71-4202 

76-1639 

79-4900 

60 

69-3347 

66-0814 

74-3970 

79-0819 

83-2076 

88-3794 

91-9617 

70 

69-3344 

77-6766 

86-5271 

90-6312 

96-0231 

100-426 

104-216 

80 

79-3343 

88-1303 

90-6782 

101-879 

106-629 

112-329 

116-321 . 

90 

89-3342 

98-6499 

107-566 

113-146 

118-136 

124-116 

128-299 

100 

99-3341 

109-141 

118-498 

124-342 

129-661 

136-807 

140-169 

yp 

0-0000 

-f 0-6746 

-t- 1-2816 

+ 1-6449 

+ 1*9600 

. 

+ 2-3263 

I +2-6768 


Eor ^>100 take xS(-P) = — ^ + = 

according to the degree of accuracy required. 
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190 Table of ■percmtage points of the •f‘ distribution 

The relation with the incomplete i -function is giren by 

( 3 ) 

where 1), (4) 

the degrees of freedom, v, of y® being given by 

v = 2p*-t-2. (6) 

The above relations were used for the computation of Xvi^)' Most of the 
entries were obtained from the Tables of the hicoinphte r-Function&), In these 
tables the column headed p* = Iv — L was entered, the iroot u of 

I{u, p*) = 1 - P 

found by inverse interpolation and transformed into the corresponding percen- 
tage point for y® by substitution in equation (4). 

Although for small values of p* and v, the table of l(u, p*) is not interpolable, 
formal inverse interpolation in this range of the table still yields approximate 
values of the percentage points. To make these accurate, auxiliary tables of 
PfX^) were constructedior arguments the neigh bourhood of the approximate 

percentage points, and the exact values of xHP) found by inverse interpolation 
in these auxiliary tables. The latter were constructed from the expansion of 
P,,{x^) given on p. xxxi of Tables for Statisticians and Biometridans, Part 1(3). 
Since the auxiliary tables were required only for smaU. values of x® a few terms in 
the expansions were sufficient to yield i^(x*) to the required accuracy. 

Whilst the existing table of percentage points of x® ( b is confined to the range 
V = 1 (1) 30, the range v = 30 (10) 100 has been added in the table below. This has 
been done because the customary approximation to Xv{P) by the corresponding 
normal deviates is not veyy satisfactory in this range of v. There are, in fact, 
percentage points which differ from the approximate ones in the second significant 
figure. In our table, however, linear interpolation (which is particnlaiiy con- 
venient at interval 10) yields interpolates accurate to about four significant 
figures. If we write v — 104 + m with 3 < 4 < 10 and 0 < m < 10 then we have 

= iW( 10 - «i) XloJc + mxlojc+io)- (0) 

For instance for v = 64 and P = 0-01 we have 

Xi4(0-01) -^{6xio(0-01)-l-4xlo(0-01)} = 81.04. 

If higher accuracy is required we have to use the four point Lagrangian formula, 
viK. 

= •^-lXl0fc-10 + -^0Xl0& + l^l?(Il0fc+10 + .^2Xl0/c+20 (1) 

where the Lagrangian coefficients L_i, L^, and are tabulated below. 
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m 

•b-i 



h 



_ 

-i- 

+ 

_ 


0 

0-0000 

1-0000 

0-0000 

0-0(jO0 

10 

1 

0-0286 

0-9405 

0-1046 

0-01G5 

9 

2 

0-0480 

0-8640 

0-2160 

0-0.320 

8 

3 

0-0696 

0-773.5 

0-.3,316 

0-045.5 

7 

4 

0-0640 

0-6720 

0-4480 

0-05ti0 

6 

6 

0-0625 

0-S626 

0-602.5 

0-0625 

S 


- 

+ 

X 

- 



h 

h 

h 


m 


Returning to the above example we obtain. 

Xl4(0-01) = - 0'0640;\;|o(0'0] ) + 0-6720xlo(0-01) + 0‘448();v|o{0-01) - 0-0560 a'|o(0-01) 
= 81-069 

which is the exact interpolate to five figures. 

For V > 100 we can make use of Fisher’s approximation to l],{x^) by the normal 
probability integral and calculate Xvi^) ^roin. the formula, 


where the normal deviates yp corresponding to the thirteen percentage levels 
are given in the last line of the main table. For v - 100, this approximation has 
an accuracy of about 1 % in the worst cases, but as v increases it becomes more 
accurate. 

A more accurate approximation has been given by Wilson and Hilfertyti); 
this assumes that is normally distributed about 1 -2/(9r) with standard 

deviation ^{2j9p). That is to say, the probability levels may be calculated from 

Comparative numerical values showing the relative accuracy of these two 
formulae are given in the Note by Mrs M, Merrington on pp. 200-2 below, 
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MISCELLANEA 

(1) Theory of Probability. By Habold jB]?KRBys. Oxford University Press. 1939. 

7 + 380 pp. 21a. net. 

In the history of the application of probability theory to the problem of drawing inferences 
from observations, no set of ideas has played a more controversial role than that asso- 
ciated with inverse probability. Variotia leaders of modem thought in statistical inference 
have pointed out the logical difficulties inherent in the application of inverse probability. 
E. A. Fisher, through hia methods of maximum hkeliliood and of fiducial limits, has intro- 
duced principles of statistical inference which make the introduction of the notion of inverse 
probability irrelevant. These principles have been extended and refined by J. Neyman, 
E. S. Pearson, A. Wald and others, until now we have available in statistical literature a 
self-consistent discipline of statistical inference which is independent of inverse probability. 

In the present book the author proposes a system of statistical inference based on the 
principles of inverse probability, applying it to the same problems which have been treated 
by Fisher, Neyman, Pearson and othens without using inverse probability. The attitude 
which the author talces towards probability theory is somewhat similar to that taken by 
J.M. Keynes. Probability is regarded as a subjective phenomenon. The essential idea is that 
probability is a matter of comparing ‘reasonable degrees of belief’ in propositions. In 
Chapter i the author goes through a considerable amount of psychological and philosophical 
discussion attempting to justify this approach. This discussion is finally formalized by a set 
of six axioms. The primitive or undefined notion is that of the relation ‘given p, q is more 
probable than r where p, q, and r are propositions. The symbol used for denoting the prob- 
ability of q given p is P(g j p). The six axioms which are used are as follows; 

(1) Given p, q is either more or less probable than r, or both oi'o equally probable; and no 
two of these alternatives can be true. 

(2) If J), g, r, a are four propositions and given p, g is more probable than r and r is more 
probable than a, then given p, q is more probable than ®. 

( 3) All propositions deduoible from a proposition p have the same probability on data p ; 
and all propositions inconsistent with p have the same probability on data p, 

(4) If, givenp, g and q' cannot both bo true, and if, given p, r and r' cannot both be true, 
and if, given p, g and r are equally probable and q' and r' are equally probable, then given 
p, ‘ g or g' ’ and ‘ r or r' ’ are equally probable. 

(6) The set of possible probabilities on given data, ordered in terms of the relation ‘more 
probable’, is not of higher ordinal type than the continuum including the end-points. 

(6) If pg entails r, then P(gr j p) = P(q\ p). 

In axiom 6, the expression ‘a entails b' is defined as meaning ‘a is deducible from &’, or 
‘a is identical with &’, or ‘ a is identical with some proposition asserted in 6’, The expression 
‘ at ' is taken as the logical product, that is ‘both a and fe ’. 

The introduction of numbers for expressing probabilities is made through three ‘con- 
ventions ’. The first convention associates the larger of two numbers with the more probable 
of , two propositions, the second one states that if given p, g and g' are mutually exclusive, 
thenP{g|p)-(-P(g^ Ip) = P(g or g'|p), and the thirdstatesthatifpentailsg, then P(g|p) = 1. 

In order to be thoroughly rigorous in his axiomatic approach, presumably the author 
should have postulated the existence of an aggregate of propositions on which to operate. 
The possibility of an infinite number of propositions should, of course, not be excluded, as 
will be seen when the author applies his theory to problems involving continuous random 
variables. In the case of an infinite ninnber of propositions, it appears that the assumption 
should be made that axiom 4 would hold in case of two infinite sets of mutually exclusive 
alternatives. A similar assumption would have to be made for the second convention. The 
proponent of the measure theory approach to probability has essentially the same problem 
to deal with, but he handles it by assuming the existence of set functions which are completely 
additive over his postulated field of sets, 
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Proceeding from his six axioms and three conventions the author completes his chapter 
by establishing twelve theorems which are used as the basis for the work in the subsequent 
chapters. The question of an infinite number of alternatives is not covered by these theorems. 

Chapter n, on ‘Direct Probabilities’, is devoted to derivations and discussions of the 
binomial, normal, Poisson, Pearson, multinomial. Chi-square, t, a and other frequency laws 
and their properties. The characteristic function and illustrations of its use in the deter- 
mination of probability laws are presented. 

In building up his system of statistical inference Jeffreys proceeds in Chapters ni-viii 
by applying the principle of inverse probability to the results of Chapter ii. Thus if /S is a 
set of observations subject to a given discrete distribution law P{S \ OH) derived under a 
distribution hypothesis H, where d is a parameter to be estimated, an a priori probability 
function g{d) dd for 6 is introduced. The posterior probability law of 0 giyen S, say P{0 [ SH), 
is given by 


where S denotes summation with respect to all possible configurations of values of 8, and 
the integral is taken with respect to 6. A similar analysis results when S is subject to a con- 
tinuous distribution law. The function P(6 1 SH.) is then taken as a basis for estimating d. 
For two values of 0, say d^ and da, 6i is called more probable than 6^ if P{0i \ SH) > P(dj | SH ) . 
Needless to say, this comparison of posterior probabilities depends on the choice of g{6). 
Now, from the point of view of applying this discipline, g{d) would rarely, if ever, be known, 
and the controversy over inverse probability centres around the problem of choosing g{6). 
The author adopts two rules for selecting g(d)x (1) If the parameter may have any value in 
a finite range, or from — oo to -I- oo its pyfior probability should be taken as uniformly dis- 
tributed. (2) If the parameter may conceivably have any value from 0 to -f oo, the prior 
probability of its logarithm should be taken as uniformly distributed. The adoption of these 
two rules appears to the reviewer to be extremely vulnerable. First of all, what does it mean, 
in general, for a parameter to be uniformly distributed on the interval — oo to -f- oo ? This 
question appears to be particularly in order since the author is performing the formal calculus 
of probabilities in exactly the same manner in which measure proponents calculate prob- 
abilities. They continually use the property that a finite total probability (taken arbitrarily 
as unity) is associated with each probability function. Presumably, meaning could be 
injected by carrying out the work for a finite interval — K to K and then taking the limit of 
the results or answers as iC ->■ oo. Similarly, it may be asked what it means to have the loga- 
rithm of a parameter uniformly distributed on a semi -infinite interval. Owing to the nature 
of the particular problems to which Jeffreys applies these rules, it happens that P{S | dH) 
is such that formal difficulties of convergence do not arise in obtaining P(d \ SH), For the 
finite interval, why should one choose the parameter to be uniformly distributed rather than 
the square or some other function of the parameter, or for the setoi-inflnite interval (0, oo) 
why should one choose the logarithm of the parameter rather than some other function to be 
uniformly distributed? It is easy to show that the assumption of uniform distribution is, in 
general, inconsistent with that of uniform distribution of any single-valued function of the 
parameter. 

Chapter iv, entitled ‘Approximate Methods and Simplifications’, contains discussions, 
some of them rather heuristic, of the problem of estimation involved in such topics as 
maximum likelihood, least squares, errors due to grouping, rank correlation, contingency, 
artificial randomization, etc. The inverse probability approach is, of course, maintained 
throughout. 

The problem of significance tests is treated in Chapters v and vi. J effreys’ attitude toward 
significance tests follows along the lines of his concept of probability and consists in com- 
paring posterior probabilities. More speoifloally, suppose d is a parameter and it is desired to 
test the h 3 ?pothesis that d = 0 on the basis of a given set of observations and an hypothesis H 
regarding the distribution law of S for given d. Let g denote the hypothesis that d = 0, and 
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g denote the hypothesis that 6 has some other value. Jeffreys’ criterion for makmg the 
mgnificance test is the ratio K of the two posterior probabilities P( 2 ' | SIl) and Pig j SH). 
The value of K itself, and not the probability integral of K under the null hyijothesis q, is 
proposed as the criterion. Expressions for K are found for such problems as contingency, 
comparison of means and variances in samples, consistency of two Poisson parameters, 
correlation — problems which have already been treated by Fisher, Neyinan, Pearson and 
others by other approaches free from inverse probability. In the treatment of all of these 
problems, the author arrives at four principal forms of K, which he proceeds to tabulate in an 
appendix for various values of sample size andfor five grades of significance corresponding to 
K = 1, 10“1, lO”’-, 10-S, 10~®. The last two chapters, i.o. vii and viii, are entitled ‘Frequency 
Definitions and Direct Methods’, and ‘General Questions’, respectively. These chapters 
are primarily philosophical excursions undertaken in an attempt to show that no existing 
definition of probability avoids the notion of * degrees of reasonable belief ’ , and to j ustify his 
own approach as well as his attitude toward inverse probability. The discussion is almost 
entirely informal and non-mathematical and as .such it must be regarded in the category of 
personal opinion. 

The book lacks strict mathematical rigour in various places, but from the point of view 
of general flow of discussion it is interestingly written. It contains many keenly chosen 
quotations and side remarks charged with a delightfully subtle humour which has cha- 
racterized the author in other books. 

From a scientific point of view it is doubtful that there will be many scholars thoroughly 
familiar with the system of statistical inference initiated by R. A. Fisher and extended by 
J. Neyman, E. S. Pearson, A. Wald and others who will abandon this system in favour of 
the one proposed by Jeffreys in which inver.se probability plays the central role. 

FEINOETON S. S. WILKS 

UNIVERSITY 


(ii) A Bibliography of Human Morphology, 1914-1939. By Wilton M. Kbooman. 

United States of America: University of Chicago Press; Great Britain and Ireland: 

Cambridge University Press. 1941. Price ISs. 

The title of this volume may mislead to some extent. The 11,000 odd references in it were 
collected to aid physical anthropologists and the work will be of greater value to them than 
to other research workers, such as anatomists and geneticists, who are concerned with hmnan 
morphology. The non-German literature is said to be covered more thoroughly than in the 
second edition of Rudolf Martin’s Lehrbuoh, and there is no other comprehensive bibliography 
of the subject for the period since 1928. It is not claimed that the list is exhaustive, and the 
most stringent selection appears to have been made in the -section on blood groups for which 
fuller bibliographies are available. 

G.M.M. 


(ill) A Property of the Distribution of Excremes 

By H. E. DANIELS 
Wool Industries Besearch Association 


If the chance of an observation being less than at is P, then P” is the chance that the greatest 
of a random sample of n is less than at. The constants of the distribution of the greatest of a 

dx 


sample in the important ease when P 




-i©* . 


V(2u) 


have been calculated by Tippett (1925) 


for values of n up to 1000, and in a paper in which all possible limiting forms of P" are dis- 
cussed, Fisher & Tippett (1928) give limiting formulae from which approximate values of 
the constants axe calculated for largo samples. 
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In the present note attention is drawn to a curious approximate relation eonriecting the 
mean M anti standard deviation <r of this distribution which holds with high accuracy for 
all values of n. It was arrived at empirically and appears to har''e no obvious mathomatical 
derivation. The formula is ^ 2 


Tho values of 2 cot ^rr<r, calculated from Tippett’s figures, are compared in Table 1 with 
Tippett’s values of Jtf . The greatest discrepancy, S — M — Z cot Jttct, over the rang© of n up to 
1000 occurs at n = 10 where it is no more than about 1| %. For n greater than 1000 tho 
penultimate limiting values given in Table A of Fisher & Tippett’s paper are used in Table 2 
and the discrepancies are again found to be small. Tho nio.st serious is of tho order of 3 % 
when n is 7228, but their Table B suggests that the penultimate limiting values of M and cr 
are probably underestimated and as 2 cot is fairly sensitive to changes of cr in the region 
of O' = 0-3, the real discrepancy may perhaps be smaller. 


Table 1 


71 

— 

M 

(T 

2 cot \n(T 

iS:=^— 2cot^ir<r 

1 



0-0000 


2 



0-6617 


6 

1-1630 


1-1448 


10 



1-6176 

-0-0212 

20 


0-6261 

1-8483 

-0-0192 

60 

2-3193 

0-4646 

2-3086 

-0-0107 




2-6012 

-0-0064 




2-7448 


600 

3-0367 


3-0407 



3*2414 

0-3614 

3-2476 



Table 2. Penultimate approUdmate values 


71 

M 

<r 

2 cot Itto- 

d = iH — 2 cot |irtr 

7228 

3-7697 

0-3039 

3-8661 

0-0964 


4-7719 

0-2499 

4-8311 


264x10® 

6-9262 

0-1787 

6-9369 

0-0107 


Fisher and Tippett show that in the ultimate limiting form of the distribution tho mode 
m, mean M and standard deviation cr are related by the formulae 

M = m+yo, cr” = 

where c = 1) and y = 0'677216 is Euler’s constant. Consequently 






1 ^ ^ 


1-282 


as cr becomes small with increasing n. On tho other hand, our approximate relation gives 

4 1-274 


M = 2 cot ■jTTcr.- 


TKT 


The error at m = oo is thiis seen to be less than 1 %, 


REFERENCES 
Tippett, L. H. C. (1926). Biometrika, 17, 364. 

Fishbb, R. a. & Tippett, L. H. O., (1928). Proa. Oamh. Phil. Soc. 24, 180. 
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(iv) Proof of Relations connected with the Tetrachoric Series and 
its Generalization 

By M. G. KENDALL 

If a bivariate normal distribution F with variates and and correlation p is doubly 
dichotomized at = h, aij = k, and 


d 


(*co /*ao 

-LL"’ 


it is Itnown that 


d = S p''Tr(h)rf{k), 
r=o 

where r, is the rth tetrachoric function, defined by 


r^ix) = ■ 


(r!)i 


( 1 ) 


( 2 ) 


and/(a:) being the function 


and H,(a;) the rth Hermite polynomial defined by 

f(x) = (-Dyf(x). (3) 


V(2v) 


The purpose of this note is to present a simple proof of this result,* to prove that the 
series of equation (1) is convergent for | p 1 ^ 1 and to generalize the series to the case of the 
multivariate normal distribution.f 
The characteristic function of 

1*00 ^00 

is, by definition, ^(^>*2) = I exp{itiXi + it^Xi}dF, 

J —00 J — 00 

and is easily seen by direct integration to be equal to 

exp { - Kt? + ipkh + *2)}- 


We have then, for all finite q, tj. 


(4) 




h) = exp ( - exp ( - S ( -p) 

r^O »• 


/•cO ^CO J |*'50 ^00 1*00 

Now d=| 1 ^^=7— — , dxA dxj I I <l>{kA^e:xp{ — ikXi — ikx^dtxdti. 

JhJh 

Substituting for^(h>t2) we have, for the coefficient of f ~pYjr\ in this expression, 

J ^00 1*00 ^00 ^co 

^Jh k J J ^ ~ 4^2) { ~ ~ *^ 2 '’^2} (Ih 

and this is the product of two integrals, the first of which is 


s/m: 


exp ( — it?) exp { — iti ajj} dk 


(5) 


( 6 ) 


and the second of which is a similar expression in Kj and t j. 

* The expansion (1) appears tp have been given for the first time by G. Mehler, ‘Reihew- 
entwicklung naoh Laplaceschen Funotionen hoherer Ordnung’, J, reine ctngeiv. Math. 66, 161. 
f See Note at end of paper. 
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Now, since i ex:p{ — \ff‘)V'er<-**dt-. 


d(~ixY 


exp( — Ji®) 




(6) is equal to 


J oe 


(-iy dxB,(x)f(x) = 


and thus (6) is equal to ( - lYH^_^{h)f(h)Hr_i(k)f(k), 


and hence 


Y { 


= Sp%(h)r,{k), 

the tetraohorio series. 

Now for the convergence of the series, consider 


From (6) 


I rA'h) T,(h) I = (r !)-i I H,^Yih)m 

I ■H’r-i(A) /(^O I ^ ^ [ J e’fP ( - ¥^) f-'e-'** dt 

1 r“ 

t exp( — 

J 0 

r-2 

S'* 


^r-^rn- 


|Tr(7t)T,(A!)|<- 


2r-ae-(r-«2n-^L^^ 
ir^{2n) e~'r’'+i 


^(27r)r*’ 

Thus the tetrachoric series converges for | p | < 1, though possibly slowly near \p \ — 1, 
Now consider the general multivariate normal distribution 

- — ^exp- 


{iirfSY 


B= 1 pi8 ... Pin 

pi2 1 ... Pan 


1 Pm Pan ••• 

and is the minor of the jth row and Mh colxnnn. 

The characteristic function is 


/ OO ^00 

... I dF exp (Sitj Xj) dx^... dx„. 

— CO J —CO 
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To ovaUiato this integral make tho tranaforraation 

k 

and choose the a’s ao as to reduce the second deg»-eo terms in g in the exponent to the canonical 
form The remaining terms in t and g will obviously be linear in each. We may then 
make at'urther transformation g' = g— (linear function of the t’a) and the remaining torins, 
apart from i^g'®, will bo a quadratic in the t’s, and equal, say, to 

Tlio integration, abolishes tho toims in |' and wo find that tho characteristic function ia 

proportional in „ 

©xp {hy, ^ + ZSijktjtk]- 

Putting all hut two of tho t’s ssero we see by comparison with (4) that the terms in tj, ij, 
is - ‘witl thus tho characteristic function ia 


exp { — ^(TiJ + 


(9) 


The generaliKed tctrachorie expansion can bo obtained by an expansion of (9) in terms 
of the p’s and tho application of tire foregoing procedure. For instance, with three variates 
we ha%;o 

(-!)’■ 

oxpZ{~p]iitjtji) = 27 — — {pnhh'^P%3^sh+Pn^ihy 

Tl 


and on integration 




d.F 

hj 




which will also be found to be convergent. 


[jVo/r. Since writing tins paper, under the impression that the resulte were now, I am 
indebted to Dr A. C. Aitken for pointing out that similar results have been given by him in 
lectures for a number of years. Among published work, reference may be marie to: 

(o) P. 176 of Aitken & Tumhuirs Theory of Canonical Matrices (Blackie, 1931), where 
a mote direct method of deriving tho oharaoteriatic fmiotion of the multivariate normal 
distribution has been given ; 

(6) A paper on ‘Fourfold sampling witli and without replooernent’ by A. C. Aitken & 
H.T. Gonin (1936, Proc. Hoy. Soc. Edinb. 55, 114), where tho tetrachoric expansions are 
discussed and new series associated with the correlated binomial and tho coiTelated hyper-,, 
geometric distributions are derived. 

As the work of tho Edinburgh school on this subject may not, however, be generally 
familiar, tho Editor ]»as suggested that this short paper should be published, together with 
tlie foregoing references, in order to bring the recent developments before a wider statistical 
audience, m.g.k.] 
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(v) The CumBlants of the Distribution of the Square of a Variate 
By J. B. S. HALDANE, F.R.S. 


The foUowing problem has arisen in several biomotrio investigations. The cnmulants of the 
distribution of x are known, and it is desired to find the cumulants of the distribution of 
As this problem is likely to arise in future, it.seems desirable to give the appropriate trans- 
formations for the first few cnmulants. 


Let Kj, /Cj, K 3 , . . . be the cumulants of x. 

Let X, /<a, /tj, ... be the moments of about zero. 

Let /tg, |l ^, ... be the moments of w* about its mean. 

Let k'i, ATg, A's, ... be the cumulants of a:*. 

Then /i' is the 2»-th moment of x. Those have been given in terms of the cumulants up to 
the iOth, i.e. /tj, in the general case by Kendall (1940), and up to the 12th, i.e. /ij, by Haldane 
(1938) when = 0. We consider the general case first. We have such expressions as 

jUg = kJ + 6 ki /fg + 4/Ci Kj + ZkI + K^. 


From these we calculate the moments fi„ and hence the cumulants. The results are: 


/fj = /cJ-t-ATg, 

/Cg =5 4/CiAfg-4-2(2A^j_/rg-|-/Cg)-l-/r4, 

K 3 = SfcIlATi ATj + 3ac 1) + 4(3AjAr| -I- 12/Ci/fj;f3-l- 2i4) + 2(3/fiK, + O/Cj/rg + 5kI) -b Kg, 

Kg = 16Kf{AfiA:4+ 12 /<'iA:3a: 8-(- 12Kg) 

+ 16(2a:Ja: 5-(- ISKjKgKj-l- 12 k?/c|+ 3^) 

+ 8(3«J ATo + + SSKiKgKg + IS^fg/fi-i- SOaTjKs) 

+ S{K^ K,j + SA'j Kg -b 7/Cg /C5 -b 4A4) -b ATg. 


( 1 ) 


After tliis the expressions become very heavy. When Kg = 0, i.e. » has its mean zero, most 
of the terms vanish, and wo have 


k'i ~ Kg, 

Kg = 2Kg -b Kg, 

Kg ~ SKgp 2(6KgK4 -b 8K3) -b 0, 

K4 = 48K^ -b 48Kg(3KgK4 -b 6 k|) -b 8(3KgK, + TKgKj + ikl) b Kg, 

K'g =s 884 Arl + 960 Ki(KaK 4 + SK?) + 80(16K2Kj+28K8KjKB + 6KlK,-b26K|K4) 

“b 2(20Ar3 Kg -b fiOKg Kg b lOOKgKg b 83Kg) bKjg, 

Kg = 3840 k| b 9600K|(3KaK4 b 10 k|) b 4800(2K|Kgb 14A^KjK5b 8 a^K 4+ 26KgK|K4b 3 kJ) 
b 40(30 k1k 8 b ISOKg Kg Kg b SOOKj KgKg b 226 k|k 0 b 189KgK§ b Ql^KgKgKs b 132 k|) 
b 4(lSKjKig b 66KgK, b 120K4Ka b 198KgK, b U3 a:|) b Kij. 


(2) 
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Finally, if * be symmetrically distributed, so that all its odd oumulants vanish, 
4 - Kj, 
atJ K 

4 = 84+12/Cg/Ct + x„ 

4 = 48K’5+l44Ac|K4 + 8(3*rs*;»+4(4) + *;a. 

4 = 384/(:| + 1920K§/C4 + 180^2(3/fj/f8 + 8Ar5) + 40(AraAr8 + 6/CiK«)+Ati„, 

4 - 384:04 + 288004^, + 9600x^(/(2Ktt + 4:4} + 2iO(BKiKg + 60/(ifC4Ke + 224) 
+ 4:(lBx^Ki^+ 120x,Ki+ llS/rli+ATia. 


( 3 ) 


I have bracketed together terms -which are products of the same number of x/a. If x is 
a linefar function of observed numbers in a sample of n, every Ar„ is proportional to x, so the 
terms in braoketa,will all be multiples of the same power of n. 


REFERENCES 

Haldane, J. B. S. (1938). ‘The first six moments of for an n.-fold table with n degrees 
of freedom when some expectations are small.’ Biometrika, 29, 389-91. 

Kendall, M. G. (1940). ‘The derivation of multivariate sampling formulae from uni- 
variate formulae by symbolic operation.’ Ann. Bugen., Land,, 10, 392-402. 


(vl) Numerical approximations to the percentage points of 
the distribution 

By MAXINE HERRINGTON 

The use of two approximate formulae has been suggested for oaloulating a percentage 
point, ^{P)i of the x‘ distribution, corresponding to v degrees of freedom and a probability 
level P.* Both formulae involve the use of the standardized normal deviate, y^, corre- 
sponding to the value of P chosen. 

(1) R. A. Fiafier’s {102B) formula : 

Xl{P) = i{yF+4i^v~l)}\ (a) 

which assumes that i® normally distributed about with unit standard 

deviation. 

(2) E. B. Wilson. <5a M.-M, Hilferly's {1031) formula: 

= (b) 

which assumes that {xV4^ is normally distributed about 1 — 2/(9r) with a standard devia- 
tion of y'(2/9v). 

Professor Pearson has suggested that I should prepare a comparative table showing 
for certain v and P the numerical values of ^{P)x 

(а) calculated from Fisher’s formula, 

(б) calculated from Wilson & Hilferty’s formula, 

(o) the correct values taken from Miss Thompson’s table (pp. 188-9 above). 

"■ The notation used is that adopted in the paper on pp. 187-91 above. 
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MEDICAL STATISTICS EEOM GRAUNT TO EABR 

(Continued*) 

BY MAJOR GREENWOOD 

III. THE STATISTICAL WORK OE GRAUNT 

John Geahnt’s contribution to our subject has always been regarded as one 
of the great classics of science. A few have indeed doubted whether so great 
a work could have been achieved by one whose material success was so modest 
and have sought to transfer the glory to Graunt’s highly successful friend Petty. 
This dispute I relegate to an appendix. I assume that Graunt’s published book 
is substantially his own original work. 

The history of the material Graunt used has been written more than once 
and I have nothing to add to Prof. Hull’s story. Graunt had, for a period of more 
than 00 years, arithmetical statements of the numbers of males and females 
christened and buried and of the causes of death (not distinguished by sex) 
under some sixty headings. He had no information as to the ages at death. He 
had no information as to the number or ages of the living population. 

The first act of a scientific statistician is to assess the trustworthiness of his 
data, to criticize his sources. This' tedious preliminary to the doing of sums was 
not much to Petty’s taste. Petty, as we have seen, often used different data to 
reach some conclusion, but hardly ever discusses the reliabilities of the several 
data. Other Fellows of our College since Petty’s day have made the same 
mistake. The terrible ‘howler’ committed by Dr William Heberden the younger, 
and detected, not without satisfaction, by Charles Creighton is classical.f But 
that was not a unique instance. Indeed, even trained statisticians sometimes 
confuse names with things. More than one rate of mortality has risen (or fallen) 
only on paper. Graunt made no such mistakes. 

Graunt’s general argument is that many causes of death are ‘but matters of 
sense ’, for instance, whether a child were abortive or stillborn, and that in many 
cases the searchers are ‘ able to report the opinion of the physician, who was 
with the patient as they receive the same from the friends of the defunct’. But 
sometimes the searchers will, be wrong and often enough the error will not matter. 

As for consumptions, if the searchers do but truly report (as they may) whether the 
dead corpse were very lean and worn away, it matters not to many of our purposes whether 
the diseases were exactly the same, as physicians define it in their hooks, Moreover, in 
ease a man of seventy -five years old died of a cough (of which had he been free, he might 

* The earlier sections were printed in £iometrik(t, 32, 101-27. 

t Creighton, History of Epidemics in Britain, 2, 747-8. Heberden supposed (erroneously) that 
‘Griping of the Guts’ of the Bills was Dysentery and had decreased. It was Infantile Diarrhoea 
and had simply been transferred to the rubric ‘Convulsions’. 

Biomet rika xxxn 


14 




204 


Medical statistics from Or aunt to Farr 

have possibly lived to ninety) I esteem it little error (as to many of our purposes) if this 
person be in the table of casualties, reckoned among the aged, and not placed under the 
title of coughs (348).* 

No doubt this brutal common sense might set on edge the teeth of some 
Fellows of the College of Physicians even in the seventeenth century, but it was 
one of the qualities which made Graunt a pioneer. Making the best the enemy 
of the good is a sure way to hinder any statistical progress. The scientific purist, 
who will wait for medical statistics until they are nosologically exact, is no wiser 
than Horace’s rustic waiting for the river to flow away. 

Graunt, however, did not accept statements which he had the means of 
testing. Finding in a series of years that of more than a quarter of a million 
deaths only 392 were assigned to the Pox, he did not infer that Syphilis had 
been over-rated as a cause of death. 

Forasmuch as by the ordinary discourse of the world it seems a great part of men have, 
at one time or other, had some species of this disease, I wondering why so few died of it, 
especially because I could not take that to be so harmless, whereof so many complained 
very fiercely j upon enquiry, I found that those who died of it out of the hospitals (especially 
that of Kingsland, and the Look in Southwark)' were returned of ulcers and sores. And in 
brief, I foimd, that all mentioned to die of the French Pox were returned by the clerics of 
St Giles’ and St Martin’s in the Fields only, in which places I imderstood that most of the 
vilest and most miserable houses of uncleanness were; from whence I concluded, that only 
hated persons, and such, whose very noses were eaten off were reported by the searchers 
to have died of this too frequent malady (356). 

In principle, the argument is still valid. 

His next example of criticism is the case of Rickets, which first appeared in 
the Bills of Mortality in 1634 and then -with 14 deaths only, but by 1659 had 
risen to 441. Was Rickets a ‘new disease’ or did an old disease receive, in the 
Bills, a new name? 

To clear this difficulty out of the bills (for I dare venture on. no deeper arguments) 
I enquired what other casualty before the year 1634, named in the Bills, was most like the 
rickets; and I fotmd, not only by pretenders to know it, but also from other Bills, that 
livergi'own was the nearest. For in some years I find livergrown, spleen, and rickets, put 
all together, by reason (as I conceive) of their likeness to each other. Hereupon I added 
the livergrowns of the year 1634, viz. 77, to the rickets of the same year, viz. 14, making 
in all 91 ; which total, as also the number 77 itself, I compared with the livergrowns of the 
precedent year 1636, viz. 82. All which showed me, that the rickets was a new disease over 
and above. Now, this being but a faint argument, I looked both forwards and backwards, 
and found that in the year 1629, when no rickets appeared there were but 94 livergrowns; 
and in the year 1636 there were 99 livergrowns, although there were also 60 of the rickets : 
only this is not to be denied, that when the rickets grew very numerous (as in the year 1660, 
viz. 621) then there appeared not above 16 of livergrown. lb. the year 1669 were 441 rickets 
and 8 livergrown; in the year 1668 were 476 rickets and 51 livergrown. Now though it be 
granted that these diseases were confounded in the judgment of the nurses, yet it is most 
certain that the livergrown did never but once, -viz. anno 1630, exceed 100; whereas anno 

* Numbers in brackets are page references to Prof. Hull’s edition of Th& Economic, Writings of 
Sir W illiam Petty together with the Observations upon the Bills of MorlaUty more probably by Oaptain 
John Graunt, Cambridge, 1899. 
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1660, livergrown and rickets were 636. It is also to be observed, that the rickets were never 
more numerona than now, and that they are still increasing; for anno 1649, there were but 
190, next year 260, next after that 329 and so forwards, with some little starting bEtckwards 
in some years, until the year 1660, which produced the greatest of all (367-8). 

This is an excellent statistical argument, and, incidentally, evidence that 
Graunt "wrote his own hook, for a physician would probably have suggested that 
the professional interest excited by the classical treatise of Glisson (assisted by 
Regimonter) which was published in 1650 might easily have increased the 
popularity of the diagnosis. Petty, who, with Ghsson, was a founder of the 
Royal Society, would hardly have ignored his colleague’s work. 

I cannot resist the desire to mention others which, while of little statistical ■ 
importance, have a medical attraction. Graunt noticed that Stopping of the 
Stomach first appeared in the Bills of 1636, increased from 6 to 29 by 1647, by 
1655 it reached 145, in 1667, 277 and 1660, 314, First he conjectured that 
Stopping of the Stomach might be the Green Sickness, ‘forasmuch as I find few 
or none to have been returned upon that account, although many be visibly 
stained with it’. He thought that possibly Green Sickness might not appear in' 
the Bills ‘for since the world believes that marriage cures it, it may seem indeed 
a shame, that any maid should die uncured, when there are more males than 
females, that is, an overplus of husbands to all that can be wives’. Then he 
wondered whether Stopping of the Stomach might not be Mother, ‘ forasmuch 
I have heard of many troubled with Mother Fits (as they call them) although 
few returned to have died of them’. But he was diverted by guessing ‘rather 
the Rising of the Lights might be it ’. He remembered that some women troubled 
with the Mother fits did complain of a choking in their throats. ‘Now, as I 
understand, it is more conceivable that the Lights or Lungs (which I have heard 
called the bellows of the body) not blowing, that is, neither venting out, nor 
taking in breath, might rather cause such a choking, than that the Mother should 
rise up thither, and do it. For methinks, when a woman is with child, there is 
a greater rising, and yet no such fits at all’ (359). He notes that Rising of the 
Lights increased in the Bills from 44 in 1629 to 249 in 1660. 

Finally, he suggests a correlation between Stopping of the Stomach, Rising 
of the Lights in adults and the Livergrown, Spleen and Rickets of children. 

‘ And that what is the Rickets in children, may be the other in more grown 
bodies ; for surely children which recover of the Rickets, may retain somewhat 
to cause what I have imagined : but of this let the learned physicians consider, 
as I presume they have’ (359). 

It might be suggested that one item under Stopping of the Stomach could 
be surgical, viz. strangulated hernia. Rupture was a heading in the Bibs, but 
the numbers are small and show no regular increase with the increase of popu- 
lation. Graunt’s attraction to what used to be called hysterical stigmata is 
interesting. One wonders how far these passages reflect conversations with 
Petty. It is clear that Graunt had no belief in the peripatetic uterus; Petty 
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would have had none. The best medical opinion of the age ia, of course, that of 
Sydenham. Sydenham (whose pathology was traditional) had a pneumatist 
aetiology of Hysteria, the origin was an ataxia of the animal spirit (which was 
the pneuma zotilcon of ancient tradition). He not only believed that Hysteria 
might be a serious or even mortal complication of organic disease — as we do 
still — but that the ataxic spirits might themselves produce humoral corruption 
and lead to chlorosis or ovarian dropsy (Dissertatio epistolaris, 92). So there is 
nothing repugnant to the best professional opinion of the age in admitting 
Hysteria to the list of causes of death. Nor is there any gross absurdity in the 
suggested correlation of increasing Rickets and increasing Hysteria, from the 
point of view of a layman. But that surmise does not imply any professional 
hint, it rather suggests a belief in a merely physical factor, the pressme of an 
enlarged organ. That passage would not have been written by a physician. 

These are sufficient instances of Graunt’s criticism of sources — ^the temptation 
to go on quoting examples must be resisted. I pass to his great achievement, 
the estimation of rates of mortality at ages when the numbers and ages of the 
living were not recorded. For such an estimation to be correct, we all know that 
the population must be stationary, viz. non-increasing, not subject to migration 
and having constant rates of mortality in the several age groups. 

It is a nice point whether Graunt or Petty appreciated the importance of 
these considerations. Graunt was certainly ahve to the fact that the population 
of London was growing and that the growth was due to immigration from the 
country. The arithmetical position was this. In the earlier years of his series 
burials and christenings were about equal in numbers, in 1605 there were 6948 
burials and 6604 christenings; in 1626, 7860 burials and 7682 christenings, in 
1636, 10,661 burials and 10,034 christenings. Later the burials continued to 
increase, but the christenings either decreased or failed to increase in the saine 
proportion. This Graunt attributed to neglect of christening owing to religious 
dissidence and gave excellent reasons fpr his view. It is clear then that there 
were two factors of increase, immigration and increasing numbers of births. 
Most of Graunt’s deductions are based upon an analysis of the deaths by causes 
for twenty years, 1629-36 and 1647-68, which he selected as years comparatively 
unaffected by plague (of his total of 229,260 deaths only 16,000 were from 
plague). 

If we treat this total as a denominator (or one-twentieth of it) it will, from 
the point of view of calculating mortality ratios, be affected by two errors. The 
deaths of immigrants will make it too large and the increasing births will make 
it too small. Can it be Graunt held that the errors balanced so that, arithmetically 
speaking, one might behave as if one were dealing with a stationary population? 
An alternative explanation is that Grarmt did not reab'ze the limitations of the 
method, 

A third possibility is that, although he knew the fallacy, he believed that 



Majob Greenwood 207 

the incorrect method gave an approximation to truth sufficient for his purposes. 
This is the solution I should he inclined to adopt were I forced to choose. 

As I have pointed out above, there is at least a suggestion that Petty did 
have some glimmering of the conditions to be fulfilled if a summation of deaths 
is to give a correct view of rates of mortality. I do not believe that Graunt was 
less informed on any point of vital statistics than Petty. However, all this is 
guess-work. 

Graunt did not know the ages of the dead; what he did was to pick out 
of the list of causes of deaths those which he thought lighted only upon children 
‘not more than four or five years old’. He chose Thrush, Convulsions, Rickets, 
Teeth and Worms, Abortives, Clirysomes, Infants, Livergrown and Overlaid. 
These gave him some 70,000 out of some 229,000. Then he assigned half the 
deaths from Small Pox, Swine Pox, Measles and Worms without Convulsions 
also to children under six and reaches the final conclusion that about ‘ 36 % of 
all quick conceptions die before six years old’. 

Is this conclusion — I will not say correct, because we have no data to reach 
a correct result — but of a reasonable order of magnitude! The answer is that 
it is eminently reasonable. Two hunch-ed years after Graunt’s death, William 
Parr printed (in the famous Supplement to the 35<ii Annual Report of the Registrar- 
General, p. oxxxvi) an outline Life Table for London. This was, of co\irse, com- 
puted by an approximately correct method, using knowledge of the numbers 
and ages of the living population, and reflects the conditions of seventy-five 
years ago. Interpolating in this we find that about 32% of ‘quick conceptions 
died before six years old’. There is no good medical I’eason for holding that the 
conditions of child life in London in the middle of Victoria’s reign were much 
better than in the seventeenth century. The old genius used a bow with a frayed 
string and made no allowance for windage, but his arrow hit the target not far 
from the white. He gave the first quantitative measure of the Herodian sacrifice 
in towns, a sacrifice which was to continue to be offered for more than. 200 years. 

Graunt then passed to the other end of life and found that 7 % of the dead 
were ‘ aged’. He conceived that the searchers would mean by ‘ aged’ persons of 
70 years or upwards, ‘for no man he said to die properly of Age who is much 
less’. His following suggestion that the proportion living beyond 70 might be 
used as a measure of healthfulness is not happy. But this calculation may have 
led him to make, or insert, the most famous passage in his book, viz. what is, 
in form, the first Life Table ever published. 

Whereas we have found, that of 100 quick Conceptions about 36 of them die before 
they be six years old, and that perhaps but one surviveth 76; we having seven decads 
between six and 76, we sought six mean proportional numbers between 64, the remainder, 
living at six years, and the one, which survives 76, and find that the numbers following 
are practically near enough to the truth ; for men do not die in exact proportions, nor in 
fractions, from whence arise this Table following (386), 

Graunt’s figures are 100, 64, 40, 25, 16, 10, 6, 3, 1. 



208 Medical statistics from Oraunt to Farr 

The one snrvivor to 76 iSj as Graunt implies, a guess ; perhaps he conjectured 
that his seven survivors beyond 70 died one a year. How he calculated his mean 
proportional numbers is unknown. Prof. Willeox conjectured that he experi- 
mented with multipliers of 5/8 and 2/3— the former nearly reproduces the figures 
(see Willeox, Revue de Vlnst. Intern, de Statistigue, 5 (1937), 327). Ptoukha 
{Oongres Intern, de la Population', Demographie historique, p. 71, Paris, 1937) 
ingeniously suggests that he used the multiplier (Of — 1)/100 or 0'63. 

We must, I fear, conclude sorrowfully that this shot did not find the bull’s 
eye. If Graunt’s survivors are compared with those shown in Halley’s table 
(when correctly used, vide infra), for 100, 64, 40, 25, 16, 10, 6, 3, 1, we should 
have 100, 56, 50, 46, 38, 31, 22, 14, 6. It is possible that child mortality was 
lower in London than in Breslau, but quite incredible that later age mortality 
should have been so enormously higher. 

But, of course, having regard to the data, it would have been more than 
genius, it would have been magic, had a correct result been obtained. 

Prof. WiUcox, whose opinion of Graunt is almost as high as mine, regards the 
passage as inserted on the recommendation of Petty and as Petty’s composition. 
He thinks that it lacks Graunt’s caution and suggests the flighty ingenuity of 
his friend. Prof. Willcox’s arguments are weighty, but I am not convinced. 
That Graunt did not — ^to use the expressive slang — feature his table is true. It is 
also true {vide supra) that passages in Petty’s undoubted writings imply that 
he had some conception of a survivorship table. But — and this is my main 
difficulty — if this were Petty’s idea, I find it difficult to believe that he would 
not have exploited it. Halley, whose economic scent was not so keen as Petty’s, 
saw the epoch-making importance of an idea which was to transform the business 
of selling annuities. It would be odd if Petty had seen it that he did not comment 
upon it. Graunt might well have hesitated, being a cautious statistician, but 
surely not Petty. 

However, in spite of modern practice, the writing of history wholly in terms 
of psychology has its pitfalls. 

Let us return to simpler appheations of shop arithmetic. The advantages of 
country life over town life from the point of view of both mortality and morality 
had been a commonplace of poets, particularly those Roman poets who spent 
much of their lives in a city, long before the seventeenth century. Graunt was 
the first to apply an arithmetical test of mortality; he compared the statistics 
of Romsey with those of London. For Romsey he had ninety years’ data of 
marriage, christenings and burials. 

His statements about the population of the parish are not quite consistent. 
In one sentence he says that it ‘both 90 years ago, and also now, consisted of 
about 2700’, but a few lines later says ‘it neither appears by the burials, 
christenings, or by the built of new housing, that the said parish is more populous 
now, than 90 years ago, by above two or 300 souls’. A little later he says ‘it is 
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clear that the said parish is increased about 300, and it is probable that 3 or four 
hundred more went to London; and it is known that about 400 went to New 
England, the Caribe Islands and Newfoundland within these last forty years ’ 
(389). Actually, from an estimate of the number of communicants (which he 
assumes to be rather more than half the total population) he makes the average 
population between 2700 and 2800. Taking the average of burials for the whole 
period to be 58, this gives him a death rate of a little more than one in 60, 
which he contrasts with the London figure of one in 32 (apparently based on his 
count of 11 families with 88 persons amongst whom 3 deaths occurred in a year; 
but this is a rate of one in 29). 

There is no doubt a certain sketchiness about this , but it was not unreasonable 
to infer that the Romsey rate was much lower than the London rate. 

Graunt found that, unlike London, Romsey had an average excess of 
christenings over burials, they were in the ratio of 6 to 4. He estimates that over 
the period the natural increase was 1059, and, as will be seen from the quotation 
made, he allots about a third of this respectively to London, to the colonies and 
to the parish itself. He argues that supposing the population of all England to 
be fourteen times that of London and other parishes to send one-third of their 
natural increase to London, then the London burials should increase about 
200 per annum ‘and will answer the increase we observe’. 

Here again the argument is reasonable. He goes on to an investigation which 
has been severely criticized. He gives a table of the greatest and least number 
of burials in each of the ten-year periods for which he has data. In each decade 
but one the maximum is more than twice the minimum. But, he remarks, in 
no decade in the London experience is the largest number of burials twice the 
smallest number (he excludes deaths from plague from his statistics). ‘Which 
shews, that the opener and freer airs are most subject both to the good and bad 
impressions, and that the fumes, steams and stenches of London do so medicate 
and impregnate the air about it, that it becomes capable of little more, as 
if the said fumes rising out of London met with, opposed and jostled back- 
wards the influences falling from above, or resisted the incursion of the country 
airs’ (392). 

Prof. Hull shook his head over this passage. ‘This is an attempt to explain 
by physical conditions the wide range in the observed country death rate which 
is really due to the narrowness of the field — a single market town — under 
investigation. It is perhaps the gravest statistical mistake that can be charged 
against Graunt’ (Ixxvii). 

I do not like to leave a hero in the lurch. I must concede that if both Romsey 
and Loudon burials were samples from a Poisson universe, the fact that the 
Poisson parameter for London was at least a hundred times that for Romsey 
would make it incredible that the London range, in terms of the mean or of the 
standard deviation, should he so wide as that for Romsey. But Prof. Hull was 
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wrong in supposing that the wide range in the Bomsey rates was due to the 
narrowness of the field of observation in a statistical sense. 

Taking Graunt’s 68 as the ‘expected’ annual deaths then, as 1/58 is smaU, 
the Poisson distribution is not far from the symmetry of a normal curve, and 
using the results of Tippett and E. S. Pearson, we may conclude that the 
expected range would be 23*4:5 ± 6*073. The observed ranges for the successive 
decades are 32, 48, 78, 23, 65, 39, 121, 91, 52. All but one is greater than the 
expectation and six diverge by more than three times the standard error. 

Something more than small numbers is involved. Still, it must be confessed 
that Graunt did not anticipate the reasoning of James Bernoulli, although an 
intuition of genius may have led him to think that something more than ‘ chance ’ 
had play here. 

Graunt devoted special attention to the demographic influence of the plague. 
In the first place, he remarks that the attribution plague understated the 
mortality due to plague. He infers this from the fact that in plague years burials 
from other causes exceeded the average greatly, ‘from whence we may probably 
suspect, that about 1/4 part more died of the plague than are returned for such’. 
Next he inferred that after a great outburst plague lingered for several years. 

The plague of 1630 lasted twelve years, in eight whereof there died 2000 per annum 
one with another, and never under 300. The which shows that the contagion of the plague 
depends more upon the disposition of the air than upon the effluvia from the bodies of 
men. "Which also we prove by the sudden jmnps which the plague hath made, leaping in 
one week from 118 to 927 ; and back again from 903 to 268; and from thence again the very 
next week to 852. The which effects must surely be rather attributed to change of the a.ir, 
than of the constitutions of men’s bodies, otherwise than as this depends upon that (360). 

Finally, he observes that within two years the city was re-peopled; a deduction 
from the time taken for the number of christenings to reach again the level of 
a pre-plague year. 

We may, if we please, smile at Graunt’s epidemiological inference. But it is 
a reasonable inference from the facts when we remember that in Graunt’s day — 
in spite of Fracastorius — contagium was not thought of as contagium vivum, 
but as a mere sympathetic vibration or passing on of something. 

I have, I hope, given an adequate sample of Graunt’s quality, but have not 
mentioned the most famous of aU his deductions. Both in London and the 
country, on the average more males were christened than females, but more 
males died young or entered celibate occupations. So we reach this conclusion; 

We have hitherto said, there are more males than females ; we say next that the one 
exceed the other hy about a thirteenth part. So that although more men die violent deaths 
than women, that is, more are slain in wars, killed hy niischanoe, drowned at sea and die 
by the hand of justice ; moreover more men go to colonies and travel into foreign pai*ta 
than women; and lastly, more remain unmarried than of women as fellows of colleges, 
and apprentices above eighteen etc. yet the said thirteenth part difference bringeth the 
business but to such a pass, that every woman may have an husband, without the allowance 
of polygamy (376). 
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The story of how this arithmetical justification of God’s providence attracted 
the attention of Derham, of how Derham’s book fired the enthusiasm of the 
Prussian Army chaplain Johann Peter Siissrailch and of how Sussmiloh’s book 
influenced Malthus has been weU told by Hull. I should not myself rank this 
section high among Graunt’s researches. From a demographic point of view 
neither judicial hanginge nor college fellowships could have had much effect in 
reducing the male excess. 

Even copious quotation fails to convey the spirit of a complete book. I have 
quoted good things, but many more remain. Graunt revealed sundry important 
truths and not the least important was that very imperfect data, if patiently 
considered, will teU us something it is good for us to know. If young medical 
ofiBoers going to parts of the empire where organized medical and demographical 
information is at no higher a level than that of seventeenth-century England — 
and there are many such places — were restricted to a single book on statistics, 
I should advise them to take not a modern scientific work, but old John Graunt’s 
Observations. 


Appendix 

Did Qraunt lorite the book published over his name'? 

John Graunt and William Petty were, as we have seen, close friends. Graunt, 
as the world judges success, failed and Petty succeeded. But by the judgment 
of scientific men in the seventeenth century, and ever since, the order of intel- 
lectual precedence was reversed. From the moment of publication a few 
discerning people perceived the originality and importance of the Observations, 
the same people who, while admiring Petty’s verve, ingenuity and worldly 
success, did not take over-seriously his bright ideas. 

But Graunt was a man of one book. Save a note upon the multiplication of 
carp and the growth of salmon, he published nothing more. Petty went on 
writing, scheming and talking for thirteen years after Graunt’s death. That often 
enough in that period, the Observations were discussed over the wine — as they 
were quoted in Petty’s writings — ^we may suppose. That Graunt discussed his 
work with Petty both before and after publication we may also take for certain, 
although we have no formal proof of it. The country statistics which Graunt 
first used were from Petty’s native parish and even if we are not disposed — as 
certainly I am not — ^to give much weight to particular turns of phraseology, stili 
there are sufficient verbal oddities in some pages of Graunt’s book to suggest 
Petty’s hand. 

In these circumstances, it would not be very surprising if Petty’s associates, 
particularly those who were not good judges of statistical work, were to conclude 
that Petty’s share in the remarkable achievement of Graunt were greater than 



212 Medical statistics from Qraunt to Farr 

appeared. It is not even judging Petty too harshly to suppose that he himself 
might come to share the opinion. There is no evidence that Petty ever did 
explicitly claim the credit. In one list of his writings (one of four), found among 
Petty’s Papers, he did include the Observations, which at least is evidence that 
he thought himself entitled to a share of the credit. We may suppose that if, 
in familiar intercourse, somebody had said ‘Come, confess Sir Wdliam, yours 
was the hand that guided the pen of poor John Graunt’, he might not have 
denied it very strenuously. I think I have produced evidence enough that Petty 
did not mider-rate his powers and was not conspicuous for delicacy of feeling. 
My guess would he that long before his death he did come to believe that 
Graunt’s intellectual success was due to his help. 

Whether Petty believed this or not, it is ceiiiain that friends and associates 
of Petty began to believe it soon after Graunt’s death, and the belief has been 
entertained by a few people in each generation since. These, with one con- 
spiouous exception, have been drawn from Petty’s friends or descendants or 
from literary critics. 

In the seventeenth century, of Petty’s circle, Evelyn, Southwell and Aubrey 
believed or said that Petty wrote or inspired Graunt’s book. Two Pellows of the 
Royal Society, Houghton and Halley, also attributed the book to Petty. The 
only one of the five who was certainly a competent judge of scientific merit was 
Halley. Halley began the memoir which contains his Breslau table with these 
words : 

The contemplation of the mortality of mankind has, besides the moral, its physical and 
political vises, both which have some time since been most judiciously considered by the 
curious Sir WiUiam Petty, in his moral and political Observations upon the Bills of Mortality 
of London, owned by Captain John Graunt. And since in a like treatise on the Bills of 
Mortality of Dublin. . . . But the deductions from those bills of mortality seemed even to 
their authors [aic] to be defective. (Phil. Trans, no. 196 (1693), p. 696.) 

Since the seventeenth century, there has been unanimity among demo- 
graphic atatisticians and economists that Petty could not have written Graunt’s 
book. Halley was quite as good a judge of scientific merit as any of them and 
a contemporary of the canvassed writers ; if I were sure that he had read and 
compared Petty’s acknowledged works with the Observations I should prefer his 
opinion to that of other ‘experts’ — ^including, of course, my own. Halley’s 
direct testimony, in the sense of a court of law, would be valueless ; he was only 
six years old when the Observations were published and became a Fellow of the 
Royal Society five years after the death of Graunt. There is no evidence that 
either before or after the period of writing and publishing his famous memoir, 
Halley worked on demography. After his memoir, but in his lifetime, a new 
epoch in mathematical vital statistics began. De Moivre, eleven years younger 
than Halley, brought out his principal works in the lifetime of Halley (1656- 
1742) and used Halley’s table. The two men must have been well acquainted, 
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for both were enthusiastic disciples and intimate friends of Newton, but Halley, 
like Graunt, made only one contribution to the literature of demography. 

So it may be doubted whether Halley were sufficiently interested in demo- 
graphic or economic writings to have read Petty’s tracts at all. Also in the 
passage cited above (apart from the writing of ‘ authors ’ not ‘ author ’) the collo- 
cation of the Observatims on the Dublin Bills with those on the London Bills is 
curious. There is no doubt that the Observations on the Dublin Bills were the 
work of Petty, and in the first edition of thern they are stated on the title-page 
to be by the ‘Observator on the London Bills of Mortality’, But this, as Prof. 
Hull pointed out (xlii), was probably a catch-penny device of the publisher, 
Mark Pardoe, to draw a public which had just taken a fifth edition of the London 
Observations. Actually the book did not sell, and when the publisher reissued an 
enlarged version, Petty’s name appeared on the title-page without any reference 
to the London Observations. I conclude that Halley’s evidence is less weighty 
than it seemed. He may well have had before him copies of Graunt’s book and 
of the two editions of the Dublin Observations. Having no other knowledge of 
the literatui'e he would naturally enough write as he did. 

If we eliminate HaUey, no other expert countenanced Petty’s authorship and 
one, Augustus de Morgan, gave an amusing but quite cogent reason for dis- 
missing the notion. 

In speaking of the variations in the annual numbers of deaths attributed to 
Rickets, Graunt said : 

Now, such back-starting seem to be universal in all things ; for we do not only see in the 
progressive motion of wheels of watches, and in the rowing of boats, that there is a little 
starting or jerking backwards between every step forwards, but also (if I am not much 
deceived) there appeared the like in the motion of the moon, which in the long telescopes 
at Gresham College one may sensibly discern (368). 

De Morgan {Budget of Paradoxes, 68; Assurance Magazine, 8, 167) commented 
on the improbability that ‘that excellent machinist. Sir William Petty, who 
passed his day among the astronomers’ would attribute to the motion of the 
moon in her orbit all the tremors which she gets from a shaky telescope. 

Down to 1927 the matter was regarded, in scientific circles, as settled. In 
that year the late Marquis of Lansdowne pubhshed a copious selection of the 
Petty Papers with what he regarded as new evidence in favour of Petty. 

The only new evidence of a direct kind was a manuscript list in Petty’s hand 
of his writings or projected writings which included the Observations. There are 
three other lists which do not include the Observations, and if we are to suppose 
that the entry really referred to the book published under Graunt’s name, then 
we must believe that in 1685 and in 1686 Petty had forgotten his best title to 
scientific immortality. The remainder of the evidence consists of parallel passages 
and ad captandum arguments to the effect that it was more probable that a 
physician had written on questions of medical statistics than a tradesman. This 



214 


Medical statistics from Graunt to Farr 

publication led to a lively controversy. Of the merits of this, I, as a party to it, 
am not an impartial judge. Purely literary arguments do not appeal to me when 
the question is of scientific method. Thus, Dr L. F. Powell attached Weight to 
the fact that Dr Johnson in conversation had attributed to Petty an observation 
(not statistical) which is made not in Petty’s writings but in Graunt’s book. 

In the discussion the word ‘ style ’ is used in different senses by the combatants . 
The statisticians are thinking of scientific method, the literary critics of verbal 
arrangement. To the former the fact that, particularly in the conclusions and 
the Appendix, Graunt’s book has turns of phraseology which suggest Petty’s 
hand, seems of httle importance. To the latter it seems very significant. 

In the article by Prof. WiUcox, which I have quoted above, the controversy 
is reviewed, and the author concurs generally with his statistical predecessors. 

Prof. Willcox does, however, differ from his predecessors in one important 
particular. He holds that the famous life table was supplied by Petty. He argues 
that this is far too conjectural to have been the work of so cautious a reasoner 
as Graunt : 

la attempting to reconstruct its origin I have surmised that after Graunt had estimated 
that 36 per cent, of the deaths were due to children’s diseases, that they all occurred under 
the age of six, and that the seven per cent, who were reported to have died ‘aged’ died at 
over 70 years of age (at one place he says over sixty), he felt unable to go further and 
reported his difficulty to Petty, already perhaps speculating about a series of similar 
problems. 

Petty guessed at the number of survivors at the end of each decennial age period, 
6-16, 16-26 etc. incidentally and characteristically ignoring Graunt’s theory that seven 
per cent, survived seventy, and assuming instead, without reason, that one per cent, survived 
seventy-six and not ono per cent, eighty -six, and that the survivors at age six decreased with 
each age period in a geometrical progression approximately equal to the 64 per cent, which 
Graunt had set for the first group (326-7). 

Prof. WiUcox’s argument is cogent. It may be strengthened by a criticism 
of the late Prof. Westergaard [Contributions to the, History of Statistics (London, 
1932), p. 23). In using this table, Graunt made a serious blunder. In order to 
estimate the number of men of military age in London he subtracted the number 
alive at age 66 from the number alive at age 16. But this simply gives him the 
number dying between those ages ; what he wanted was some average of the l^j’s . It 
is evident that Graunt was not at aU clear in his mind as to how to use a life table. 

On the other hand, if this table were really Petty’s idea, it is hard to under- 
stand why he did not exploit it. If Petty had been a Halley, the explanation 
would he obvious. The table is wi’ong; the conditions for the validity of the 
method were not fulfilled. There is indeed [vide supra) some evidence that Petty 
did know what data were necessary in order to construct a proper life table. 
One seems on the horns of a dilemma. If Petty thought the table was correct 
why did he make no further use of the method? If he thought it was wrong, 
would he have urged Graunt to insert it? 
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Although Prof. WiUcox has certainly shaken my previous conviction, I still 
feel reluctant to surrender Graunt’s table to Petty. However, there may be an 
element of sentimentality in this. At least the statisticians agree that the answer 
to the question which I have placed at the- head of this Appendix is emphatically 
yes. 

Refeebn-ces to the recent controversy in oheonologioal order 

1927. Lansdownb, Marquis of. The Petty Papers. 

1928. Gebknwood, M. J.R. Smtiat. Soo. 91, 79. 

1928. Lansdowitb, Marquis of. The Petty Southwell Correspondence, pp. xxiii— xxxii. 

1932. Lansdownb, Marquis of. The Times Literary Supplement, 8 Sept. 

1932. Bbbtt-Jambs, N. G. The Times Literary Supplement, 16 Sept. 

1932. Gkebnwood, M. The Times Literary Supplement, 22 Sept. 

1932. Lansdowne, Marquis of. The Times Literary Supplement, 13 Oct. 

1932. Po-WEEL, L. P. The Times Literary Supplement, 20 Oct. 

1933. Gbebnwood, M. J. R. Statist. Soo. 96, 76. 

1937. Wrtxoox, W. P. Revue de ITnst. Intern, de Statistique, 5, 321. 

IV. HALLEY’S LIFE TABLE 

The long and fruitful life of Edmund Halley (1656-1742) belongs to the 
general history of science ; of him it may indeed be said nihil quod tetigit non 
ornavit.- He made only one contribution to our subject, but it was of first-rate 
importance. 

The circumstances of this tmdertaking are obscure; Hajley would have 
perceived the imperfections of Graunt’s life table, but it is not known whether 
it was he who set on foot a search for better statistical material than Graunt 
had had. Inquiries were however made, and made after he.had become a Fellow 
of the Royal Society, so it is at least possible that Halley, who had travelled 
extensively in Europe (he was at Danzig in 1679 and in Italy in 1681), suggested 
that something might be found abroad. By 1691, th^'King’s Librarian, Henry 
Justel, who was in touch with the Society, had been l^bught into communication, 
possibly through Leibniz, -with Caspar Neumann, 4 scientifically minded evan- 
gelical pastor of Breslau. Neumann supplied the ^Rta which HaUey used. 

In 1883, J. Graetzer, a medical-statistical ^oial of Breslau, published a 
little monograph* which throws light upon the -v^rk. He not only extracted firom 
the Breslau archives aU the data which were opnight have been communicated 
to the Royal Society but had the Society’s ©chives searched, with the. result 
that a letter from Neumann to Justel and jlnother firom Neumarm to HaUey, 
both with statistical appendices, were di^overed. Thanks to the labours of 
Graetzer and an essay by R. Bbckh {BulMtn de Vlnst. Intern, de Statistique, 7 
(1893), 1) we can form a reasonably cle#idea of HaUey’s method, which was 
not what those who have not examine^he literature suppose it to have been. 

It is often stated that HaUey, havlg found that during the five years of 
observation the number of births onlypghtly exceeded the number of burials, 
♦ Graetzer, J., Edmund Ha^ und Caspar Neumann, 1883. 
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treated the population as stationary and constructed a life table by a simple 
summation of the deaths in the manner already explained. He was much wiser. 
What he tried to do was to construct a papulation table, in the following way. 
Suppose we know how many children were born in a calendar year, say 1690, 
in a town not subject to migration which maintains accurate registers of ages 
at death, and then we discover how many of the children horn in 1690 will be 
alive on each successive first of January by a series of subtractions. We shall 
have the survivors on 1 January 1691 by subtracting those of the children born 
in 1690 who died in 1690. We shall have the survivors on 1 January 1692 by 
subtracting the deaths in 1691 occurring among the survivors to 1 January 1691, 
and so on. This wiU give a precise enumeration of the living population, and this 
is what Halley wanted. The figures we shah obtain wih not be the conventional 
Ig-’s of a Life or (as German writers say) Mortality Table, but what in most 
modern books are represented by the capital letter L or years of life lived or 
“persons” living between the termini (see Appendix). Jf the population is 
stationary, the sum of these figures gives the population of the place under 
study. Now for ages between 1 (last birthday) and very advanced ages, is 
simply 1^ diminished by | (Ij,- l^j+i). In the first year of life (and at advanced 
ages) the difierence is greater. Thus in the first year of life deaths are not evenly 
distributed throughout the year of life, more than 70% of them occur in the 
first six months-..pf life, so that instead of subtracting half the deaths we must 
subtract nearly tbree-quarters. Halley himself assigned 68 % of deaths in the 
first year of life to the first half of the first year. The reason why Halley proceeded 
in this way was that .he knew the population not to be stationary. His idea was 
to obtain the figures l^or the first few years of life accurately — indeed just as 
they are now obtained-^^and then to correct for excess of births over deaths. 

His masterly plan was partly defeated by the fact that his Breslau corre- 
spondent Neumann was n# so good a statistician as he was. Halley’s letter to 
Neumann has not been preserved, we have only Neumann’s answer of 1 March 
1694. Probably Halley aske^^Neuinann to send him (as a check on the calcula- 
tions he had already made) tKie exact numbers of survivors on 1 January for 
five years, of births in a calendar year. Neumann did send him a table, but the 
table, as Graetzer pointed out, i\ wrong. Neumann gave the correct figures for 
1 January of the first sucoessive\ year, incorrect figures for the other years. 
Between 1 January of the year following the year the births of which are under 
study, and the next first of January ,\8ome of those horn in the starting year will 
die under and some over a year of ame- Neumann merely deducted the former, 
so he has too many survivors. To reachlithe right figure would have meant taking 
more trouble and he did not appreciate the importance of this. Bockh — ^whose 
opinion of his statistical contemporarieslhas a tinge of bitterness rare, of course, 
in other scientific pursuits — ^remarks thA-t it was not strange Neumann should 
miss the point as it had been missed by |many statisticians long after his time. 
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s at least clear that Halley had realized an. important truth which did not 
ome part of even expert knowledge for more than a century. 

The precise arithmetical detaha of Halley’s work are not perhaps of much 
iical interest. Graetzer and Bookh have done a good deal to clear it up. The 
a used (an average of five years) had 1238 births and 1174 deaths and the 
le accounts for 1238 deaths. Halley must therefore have had a plan for 
reusing deaths. It is likely, from an observation he makes on mortality in 
■ist’s Hospital, that he did not wholly depend on the Breslau figures. Graetzer 
gests that HaUey may have made two graphs, one having an ordinate of 1238 
she origin and an ordinate of 64 at the oldest age, the other an ordinate of 
4 at the origin and 0 at the oldest age and that he plotted the survivors for 
h graph based on recorded deaths and drew a curve passing through 1238 
L 0 between these graphs. It may be so. Using the original material which 
tetzer published, Bockh recalculated the table. The results do not, except at 
s over 60, differ materially from Halley’s. So far as concerns the mean after 
time (expectation of life), Halley’s table gives 27-54 years at birth, Graetzer ’s 
terial 27-69. For ages under 40 the re-working gives slightly lower and for 
}r ages higher mortality. It may be noted that Halley’s table gives appre- 
fiy higher mortality in childhood than Graunt’s, more than 43 % instead of 
/o are dead by the age of six years. But Graunt’s method would exaggerate 
rtality (so would Halley’s method, but, owing to his precautions, not so 
a-tly). On the other hand, Graunt’s estimate of age is only an intelligent 
ss. Actually, as Graetzer showed, the infant and child mortality shown by 
Iley’s table differed little from the observed rates of mortality in the city of 
slau in 1876-80. 

It has been said that Halley was not greatly interested in the medical aspects 
lis work. After describing methods of calculating the prices of annuities, he 
the following passage’*' : 

Et may be objected that the different salubrity of places does hinder this proposal from 
ig universal; nor can it be denied. But by the number that die being 1,174 per annum 
14,000 it does appear that about a 30th. part die yearly as Sir William Petty has 
iputed for London; and the number that die in infancy is a good argument that the air 
at indifferently sal-ubrious. So that by 'W'hat I can. learn, there cannot perhaps be one 
)er place proposed for a standard. At least ’tis desired, that in imitation hereof the 
ous in other cities -would attempt something of the same nature, than which nothing 
laps can be more useful. 

That the mortality of childhood depends upon the atmosphere is not so 
iish an hypothesis as it may seem to us. Halley lived before breast-feeding 
ame the exception rather than the rule. The ‘curious’ in other cities had not 
wit to follow his advice. He made no other contribution to the science of 
il statistics ; a gain to astronomy but a heavy loss to demography. 

*' I have read Halley’s paper in the collection of papers, many by him, collected under 
title Miscellanea Ouriosa, printed in London in 1706 ; the quotation is from p. 300. 
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Appendix 

Halley’s table is printed in two columns, the first headed ‘ Age current’, the 
second ‘ Persons ’ . Thus : 

Age current 

1 

2 

3 

4 

6 

6 

7 

8 

d 

10 

and so on. 

A mistake sometimes made is to suppose that Halley meant by age current 
simply the end of each year of life and that the entry against each ' age current ’ 
is the number of survivors at exact age one year less than the ‘age current’, 
viz. that of 1000 born 856 survived to the first anniversary, 798 to the second 
anniversary, etc. The fact that Halley uses the round number 1000 for a first 
entry does something to encourage the mistake among readers who have not 
consulted the original paper and it is sometimes made by people who should 
know better. It is actually a terrible ‘howler’, leading to a wholly false view of 
rates of mortality in early life. Thus if 1000 and 865 were really the first two 
entries of a Life Table as set out now, then, as the first two entries in English 
Life Table no. 7 Males (mortahty 1901-10) are 1000 and 856, we might conclude 
that mortality in the first year of life was no lower in 1901-10 than in Breslau 
in the last years of the seventeenth century. But the lOOQ of Halley’s table is 
not the number of new-born children but the average number out of 1238 born 
living between the ages of 0 and 1. This is what is called the of a modern 
table or the population living between the ages x and x + 1. If we have a column 
oi Lfs, which is what HaUey gives us, we can deduce therefrom the more 
familiar Ij-’s provided we know the starting value and the number of deaths in 
the first year of life. Halley gives both items. He says that of 1238 annual births 
348 die annually. So that his 1, is not 1000 but 1238, and bis Ij is 890. He chose 
1000 for Lo hy assuming that of the 348 deaths in the first year of life 238 occurred 
in the first six months of life, 68 %. This differs very little froni the modem 
practice; in Life Table no. 7 quoted above 73-5% of the deaths in the first year 
of life are assigned to the first six months of life. Having been given 1 q and 1^ 
we can deduce the other I’s firom the values of the L’s which Halley gives 
because, after the end of the first year of life there is little error in supposing 
that the deaths between two birthdays are evenly distributed over the year; 
so, for mstance, 1^ will be equal to L-^ less half. the difference between Ij and la. 


Persons 

1000 

855 

798 

760 

732 

710 

692 

680 

670 

661 
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and proceeding in this way we put Halley’s table into modern form. I attach 
a table calculated by Bockh. 

It will be seen that, if Halley’s table is properly used, the comparison is not 
of 1000 and 866 with 1000 and 856 but of 1000 and 719 with 1000 and 866. 

Actually this is still slightly optimistic, because I am comparing ‘persons’ 
with males. The ‘persons’ figure for 1901-10 is 1000, 869. On the other hand, 
in the Breslau data stfil births (or some of them) are included in births, so that 
the mortality is slightly exaggerated. If for instance 7% are still bom, the 
survivors to 1 will be the same, but the Iq should be reduced to 930. Or alter-' 
natively we should write 1000 and 773. 

I attach Halley’s table reduced to modern form and with the corresponding 
expectations of life calculated by Bockh (I have reworked some of the values 
from the data and agree with Edokh’s figures). 


Halley’s Tahle, expressed in modern form, together with the Expectations of Life 
at quinquennial intervals (Bockh) 


Age 

L 


Age 

L 

4 

0 

10,000 

27-64 

40 

3,567 

22-05 

5 

6,816 

41-47 

45 

3,167 

19-47 

10 

6,307 

40-26 

50 

2,761 

17-06 

16 

5,049 

37-19 

66 

2,319 

14-76 

20 

4,806 

33-93 

60 

1,914 

12-33 

25 

4,652 

30-69 

66 

1,611 

9-96 

30 

4,267 

27-64 

70 

1,103 

7-74 

35 

3,921 

24-78 

76 

670 

7-50 


V. GUESSING THE POPULATION 


My object is to trace the growth in our country of that part of statistical 
science which is of interest to students of medicine or public health. In speaking 
of such pioneers as Graunt, Petty and Halley it was proper to construe the 
obligation rather freely. Both Graunt and Petty did clearly perceive the relevance 
of their researches to matters of public health or even clinical medicine, but much 
of what petty did had a more direct bearing upon political questions than those 
of public health. Again, the life table is a way of expressing the facts of mortality 
which is valuable in some medical researches, but its importance as a statistical 
instrument has been much greater in non-medical than medical circles, above 
all of course in the financing of assurance business. The commercial importance 
of life tables was perceived by Halley and by other mathematicians of his and 
the following generations. 

Looking at the position after Halley’s publication it was clear that progress 
might be made (1) in improving the acemaoy of the life table, viz. by obtaining 
data more relevant to the conditions of life of persons who assured their lives 
or bought annuities, (2) in simplifying the very laborious calculations which the 

Biometrika xxxii 15 
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determination of praemia or purchase values required. Under (1) no progress 
worth, speaking of was made in England until the end of the eighteenth century. 
This was partly due, as we shall see, to a not entirely unjustified disbelief in the 
powers of the medical profession to change the rate of mortality, partly to 
ignorance. No first-rate English mathematician after Halley gave any critical 
attention to the theory of the Life Table before the nineteenth century. Under 
(2) considerable progress was made, but this progress is of little or no medical 
interest and to describe it would involve entering upon tedious arithmetical and 
algebraical detail. The primary medical-statistical quaesita are correct enumera- 
tions of deaths by sex at ages and by causes, and of the numbers living in sex 
and age groups. When these have been satisfied, the medical statistician can get 
to work. 

For 160 years after Graunt ’s death very little was done to improve matters. 
Down to 1801 the population as a whole had not been counted; forty years more 
passed before a reasonable age distribution was secured, and it was thirty-eight 
years after the first denominator (populations) that the first numerator (deaths) 
of the fundamental fractions was. obtained-. Until 1801 intelligent guessing Was 
the method and the guesses of the eighteenth century deserve a few pages, if 
only because they prove that statistical ability is as rare as other kinds of ability 
and that wishful thinking is not a modern foible. 

The first estimator to mention belongs to the seventeenth century and was 
a younger contemporary of Graunt and Petty, Gregory King (1648-1712). He 
was bom in Lichfield, the son of a land surveyor. At the age of fourteen he was 
recommended as a clerk to the famous herald Dugdale with whom he worked 
for several years ; after Dugdale had finished his Visitation, King worked for 
various amateur antiquaries and was eventually invited by a lady of property 
in Sandon (Staffordshire) to be her steward, auditor and secretary. Here he 
remained until 1672 when he moved to London and, no doubt through Dugdale’s 
recommendation, had a considerable amount of employment in both heraldic 
work and ordinary surveying. In 1677 he became a member of the College of 
Arms, in which he attained the rank of Lancaster Herald and so continued until 
his death, but worked for other official bodies on financial subjects. 

The decorous memoir by George Chalmers, from which I have extracted 
these particulars, does not give us a very life-like picture of the man himself. 
There is a certain likeness between the early careers of Petty and King. King 
was not indeed shipped as a cabin boy, but Mr King (the elder) drank (if we may 
venture so coarse an abbreviation of Chalmers’s statement that the father studied 
and practised his profession ‘with more attention to good fellowship than 
mathematical studies generally allow’) and King juixior was a pupil teacher at 
eleven. If he really read Hesiod and Homer, made Greek verses and taught 
himself to survey land in his thirteenth year he must have had Petty’s precocity. 
Both Petty and King had experience of practical surveying and, of course, both 
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were interested in political arithmetic. But there the parallel ends. King was 
a professional surveyor and archivist and had a reasonably successful professional 
career. Petty was — Petty. One might, perhaps, adduce as another parallel that 
King made some enemies and thought himself iU-u&ed. But the job by which 
Sir John Vanbrugh, a stranger to the College of Arms, was made a king-at-arms 
over the head of an official of twenty years’ standing would have galled the 
meekest of mankind. One may safely conclude that King had more knowledge 
of the data of political arithmetic than Petty and less originality. His vital 
statistical work was not published until nearly a century after his death, as an 
appendix to the second edition of An Estimate of the Comparative Strength of 
Great Britain by George Chalmers, London, 1803. Perhaps he never intended 
to publish it — ^he communicated the substance to his contemporary Davenant — 
and this may explain why there are no details of how some of his results were 
reached. The report reads rather like a document prepared for official use by 
persons interested in results not methods. 

The starting-point of King’s attempt to estimate the population was a return 
from the Hearth Office of the number of houses assessed to tax on Lady Day, 
1600. That was 1,319,215 which, King estimated, had increased to 1,326,000 by 
1695. He deducted 30,000 for empty divided houses,* took the round figure of 
1,300,000 and assigned 105,000 to the London area, 195,000 to other cities and 
market towns and 1,000,000 to villages and hamlets. He used a series of multi- 
pliers, 6-4 for a house within the walls of London, 4-6 for a house within the 
liberties, 4-4 for the out parishes in Surrey and Middlesex and 4- 3 for Westminster. 
For other towns, his multiplier was 4-3 and for villages 4-0. 

Having performed his multiphcations he gives London a bonus of 10%, 
other towns 2% and villages 1%. Lastly he estimates homeless people to 
number 80,000. The final result to the nearest round number is millions. 

How King obtained his multiplier is not clear. In addition to the Hearth 
Office data he says he used ‘the assessments on marriages, births, and burials, 
parish registers and other public accounts ’ and that from these he deduced the 
multipliers, but this is rather vague. He also classified the population by sex, 
civil state and age (under 1, under 5, imder 10, under 16, above 16, above 21, 
above 25, above 60). How he reached these figures is not explained. 

But nothing succeeds like success. As we shall see, his estimate of the total 

* Prof; E. 0. K. Gonner (J.R. Statist. Soc. 76, (1912-13), 261-97), in an interesting paper which 
I have largely vised in writing this chapter, remarks that the ‘howses’ of the Hearth Office must 
have been really famDies or separate occupations as King indeed realized, and thinks that King 
fell into some confusion in attempting to replace families by houses. Gonner argued that the best 
way was to proceed on the basis that the Hearth Office unit of a family should be retained and, 
be corrected for empty houses, blacksmiths' shops, etc. on the basis of 1801 census returns and the 
multiplier used should be persons per family of 1801. The result is to give a figure about a quarter 
of a million larger than King’s. The method described in the text also leads to the conclusion that 
King somewhat understated the population. 
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population is probably very near the truth and Prof. Westergaard has remarked 
that, judging from Swedish observations of a few years later, King’s age distri- 
bution is quite reasonable. 

As a statistical prophet King was no more successful than his contemporaries 
(and successors). He believed that down to his time the population of England 
had doubled in 436 years, that the next doubling would require from 1200 to 
1300 years and that in a.d. 3600-3600 the population would reach 22 millions 
of souls, in case, as he cautiously adds 'the world should last so long’. His 
estimates as a matter of arithmetical curiosity are excellently fitted by a logistic 
with an upper asymptote of fifteen millions and would give the present popula- 
tion as about eight millions. 

Modern statisticians, such as Farr and Brownlee, have confirmed King’s 
estimate of the population at the end of the seventeenth century in the following 
way. After 1801 the population was known by actual counting and for the first 
forty years of the nineteenth century baptisms and burials were still the only 
data of bh’ths and deaths. If one started from, say, the enumeration of 1831 
and worked back to the population of 1821 by adding the numbers of burials 
and subtracting the number of baptisms then, if these really measured deaths 
and births, the result ought to agree with the census enumeration, provided 
immigration and emigration balanced. But the burials and baptisms under- 
stated deaths and births. One might adjust the figures by multipliers to bring 
the result into agreement with the census and then test against another backward 
run of ten years. Brownlee found that if the number of burials were multiplied 
by 1-2 and the number of baptisms by 1*243, the agreement was good. 

This may seem a highly conjectural method, but it certainly gives quite good 
results. The difference between births and deaths estimated in this way for the 
decennium 1801-10, I find to be about 12-4 per 1000 living. If one multiplies 
the enumerated population of 1801 by (l-Ol24)io we reach lO-l millions, not a 
had approximation to 10-2 millions actually counted. Assuming that before 
1801 burials and baptisms had the same relation to deaths and births as between 
1801 and 1841, we can work backwards to the beginning of the eighteenth 
century with the result that the population then was about 6-8 millions, not 
much more than King’s estimate. In view of the following discussion it will be 
useful to consider the probable state of the population (as determined by these 
methods) in the eighteenth century. In the first sixty years of the centwy it 
grew very slowly, was about 6-1 millions in 1761 and 6-8 millions in 1761. It 
then began to increase faster, was 7-6 in 1781, 8*2 by 179! and 9*2 at the census 
of 1801 (8-9 as enumerated, but an estimate of a deficit of l/30th was made).. 

From Gregory Bang’s time to the census of 1801 we have a series of more or 
less intelligent guesses. 

These are well described in Prof. Gonner’s paper."*! Two schools of thought 
* J.E. Statist. Soc. 76 ( 1912 - 13 ), 261 - 06 . 
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did battle in the eighteenth century ; the pessimists who held that the population 
was decreasing and the country going steadily to the dogs, and the optimists 
who believed just the contrary. Both used the same weapons. The heavy artillery 
was a return of houses for taxation purposes increased conjeoturally by a figure 
for houses which escaped taxation, the sum multiplied by a conjectural average 
of persons per house. As light artillery one had the yield of taxes on commodities 
and the returns of baptisms and burials. 

A pessimist put the number of untaxed houses low and the multiplier low, 
and an optimist raised both. 

The first controversy which took place in 1754 in the proceedings of the Koyal 
Society did not attract much notice. Brackenridge (mildly pessimistic) pointed 
out that the number of houses assessed to house tax had decreased between 1710 
and 1754 from 729,048 to 690,000, which suggested a decrease of population 
(by a previous conjectural calculation based on burials and baptisms, he had 
reckoned a small increase, which was probably correct). Much turned on the 
number of houses which did not pay tax (either because the occupant was in 
receipt of alms, did not, owing to poverty, contribute to the church or poor rate, 
or through mere default). Brackenridge put the number at 200,000. His critic, 
Forster, argued that Brackenridge under-stated the number of untaxed houses, 
adducing a sample of nine country parishes with 688 houses of which only 177 
were taxed and a market town with 229 taxed houses out of 448. Using these 
figures as a basis for conjecture Forster raises Brackenridge’s 890,000to 1,427,110. 
From this (with a multiplier of 6 for town houses and 6 for country houses) he 
reaches a population of seven and a half millions — probably a considerable over- 
estimate. 

The next controversy was a quarter of a century later (in a period when the 
population was certainly increasing) and its originator was Dr Richard Price 
(1723-91), who has attained a posthumous celebrity reminiscent of the man 
whose title to distinction was that he had once been kicked by George IV. Most 
readers know him as the preacher of a sermon which was the text of Burke’s 
Beflections, most students of economic history know him as the inventor of that 
theory of the virtue of a Sinking Fund which has been likened to the economic 
system of a community which prospered by taking in one another’s washing; 
most vital statisticians remember him as the eomputor of the Northampton 
Life Table which gave a seriously incorrect picture of prevailing mortality and in- 
directly cost the country a large sum of money. Finally, in the controversy about 
to be described. Price was pertinaciously in the wrong on all the main issues. 

The apparent inference from all this is that Price was either a fool or a knave. 
Gainsborough’s portrait of the Rev. Richard Price, which hangs (or did hang) 
in the Board Room of the Equitable Assurance Society, gives no support to the 
hypothesis that Price was a fool; his life would be a promising field of research 
for a young historian with a competent knowledge of economics. His importance 
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in statistical history is not great enough to justify me in a critical study (even 
if I had the necessary training in finance and economics). My guess is that Price 
was an able, self-confident, original-minded man, who knew a good deal about 
many things and had no exact knowledge of anything. He had 'a way’ with 
him, he could interest people. In fact he had some of the qualities of Petty. It is 
easy enough to make jokes about his notion of the mysterious power of money 
to increase at compound interest and it is possible that William Pitt the younger 
(who was only a hoy when he adopted Price’s theory) was not a good economic 
reasoner. Still, even 150 years ago, there were bankers and Treasury officials, 
and it is possible that both they (and Price) were not so much bad theoretical 
reasoners as shrewd opportunists, that they were deliberately blind to the 
speoiousness of an attractive defence of a desirable financial expedient. I have 
myself sometimes wondered whether, in the eighteenth century, an Assurance 
Society would have minded very much if a Life Table had erred on the pessimistic 
side. 

Price did not enter on the population question with an unbiased mind. He 
was a keen politician and he believed that the policy of the government was bad 
■ for the country; he also believed that the wealth of a country was its people. 
Hence he believed that the population was deolinmg and nothing shook that 
belief. Had he survived another ten years, until the first census, he would 
probably have disputed the accuracy of the returns. 

Price began with the figures of houses in 1690, which he cited from Dayenant 
(they were really due to King, who communicated them to Davenant), making 
the total 1,319,000. He then gave the figures of assessed, chai-geable and cottages 
(cottages being houses too small to be taxed) as 678,915, 25,628 and 276,149, 
making a total of 980,692 in 1761. In 1777 they were 682,077, 19,396 and 
251,261, a total of 952,734. On this basis he concluded that the population had 
declined by about one and a half millions and was actually less than five millions. 

Hewlett and Wales, Price’s chief opponents, impugned every step in the 
reasoning. First, they pointed out that in the estimate for 1690 there was almost 
certainly a confusion between families and houses. Then they argued that many 
householders evaded duty (for instance by the simple plan of blocking up 
windows (the prayer ‘Lighten our darkness, we beseech thee, Oh Pitt’ is still 
remembered) and showed by direct enumeration in certain parishes that the 
returns were inaccurate. Finally, they gave reason to think that Price’s multi- 
plier was too. small. On each of these points they were probably right. Indeed 
Price was obliged to admit the validity of some of their criticisms. But he 
declined to budge ; sometimes he took ad captandum advantage of arithmetical 
slips by his adversaries, sometimes he declined to admit that their samples were 
representative, sometimes he tried to ignore the effect of corrections which he 
was forced to make. 

These were the principal arguments. Both parties used the data of burials 



Major Greenwood 226 

and baptisms as subsidiary arguments. Price seems only to have used the London 
Bills, which rather let him down; because although they seemed to help for some 
part of the century, he admits that by 1773 London was increasing and, very 
characteristically, uses this as in his favour; ‘But it appears that, in truth, this 
is an event more to be dreaded than desired, The more London increases, the 
more the rest of the country must be deserted.’ Price’s adversaries went farther 
afield and counted burials and christenings in 162 parishes in all parts of England 
for two quinquennia, one beginning in 1758, theotherin 1773. Baptisms increased 
from 47,638 to 59,567, burials from 49,653 to 53,030. 

But neither party put much weight upon what we should now consider 
primary evidence; rightly, because of its incompleteness. 

But these data were not wholly neglected by medical writers as we shall see 
in later sections. One may fairly say on the evidence here summarized that the 
eighteenth-century political arithmeticians of England made no advance what- 
ever upon the position reached by Graunt, Petty and King. They were second- 
rate imitators of men of genius. 


{To be concluded} 
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1. Introdtjotory 

When numerous organisms or organs are weiglied, the distribution of the weights 
is often positively skew. On the other hand, the distributions of linear measure- 
ments is often very close to normal. The question then arises, given that the 
volume of an organ is proportional to the product of three mutually perpen- 
dicular measurements, each being normally distributed, what will be the dis- 
tribution of the volumes'? In general the three linear measurements will be 
correlated, and the problem might appear hopelessly complex. However, it will 
be shown later that, provided certain conditions are fulfilled, the distributions 
all lie very close together. 

The problem can obviously be generalized to cover the case of the product of 
any number of normal variates. The most interesting cases are those of two, 
three, or an infinite number of such variates. Further, two special cases are com- 
paratively simple. When the coefficients of correlation are all equal to unity and 
the coefficients of variation equal we are concerned with the distribution of a 
power of the normal variate. When the correlations all vanish, we are concerned 
with that of a product of several uncorrelated normal variates. 

It is not, of course, suggested that all skew variation of weights is to be 
explained on these lines. For example the Galton-Macalister distribution, in 
which the logarithm of the variate is normally distributed, can be thought of as 
arising in at least two different ways. The weight may be the product of a large 
number of normal variates ; or for constant cell size, the number of cell generations 
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may be distributed in a certain manner about a mean in each organ, these numbers 
being normally distributed. The highly skew variation of human weights is 
probably to be explained by the fact that a rather small fraction of the human 
race lays down very large quantities of fat. Nevertheless, it will be shown that 
simple criteria, will determine whether observed positive skewness and lepto- 
kurtosis, too large to be ascribed to sampling error, can be explained on the lines 
discussed above. And in any particular case it is worth while finding out whether 
this is so. 

Any measure of asymmetry, such as or of kurtosis, such as 72 = A 2 - 

is a dimensionless number independent of the unit of measurement. Hence in a 
transformed normal distribution of the type here considered it must clearly be 
a function of the only dimensionless number derivable from the first two moments, 
namely, the coefficient of variation, c. In what follows we shall generally use m 
for the mean of the original normal distribution and for its variance, so that 
jfci is the coefficient of variation. The usual notation is used for the mean, variance 
and other moments and cumulants of the derived distribution. 

The distribution of the product of a pair of correlated normal variates has 
already been fiiUy discussed by Craig (1936). If and Wg are the mean values 
of X and Y, fcg and their coefficients of variation, and p their coefficients of 
correlation, then Craig finds for the oumulant generating function of 


XT 


k(6) 


71 1 2p \ 2 ~| 


TOg JW2 ^ 2 ) 

2 


2[i-(i+p)d][i+(i-p)d] 


-|lQg[l-(l+p)d]-ilog[l + (l-p)(9]. 


If ki = k^ = k, this becomes 

e 


K(d)^ 


The rth cumulant of 


k[i-(i+p)e] 

XY 


■ iiog[i-(i+p)0]-iiog[i + (i-p)e3. 


1^2 •J (kx ^ 2 ) 


+ {(1 +p)-- (p_ + Kr- 1) ! [(1 +pr + (p - 1)^. 

If hx = k^ = k, the rth cumulant of AT is 

= (m^ [(1 + p)’’"^ r ! + \k{{l + p)’’ + (p - 1 )''} (»' - 1 ) !]• 

Craig’s discussion is mainly confined to the cases where kx and k^ are large. In 
the cases of greatest biometrical interest they are small, and some points of 
interest arise, besides those dealt with by Craig. 
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2. Distribution of the cube of a normal variate 

If x'bea. reduced normal variate, that is to say, a variate whose mean is zero 
and standard deviation unity, it is required to find the distribution of X®, where 
X = m(l + Bx). We note that 

0, and^=:(2r-l)(2r~3)(2r-6)...3.1 = 

Hence 

X® = w®(l + + Zhx^ + Bx^) = m®(l + 3^;), 

X® = m\ 1 + 6Bx + 1 + 2QBa? + \5iBx‘^ + QBx^ + Bx^) 

m®(-I + 1 5ib + 46P + 15P), 

so that the first four moments of X® about zero are 


/I'l = m®(l + 2lc), 

/tg = m®(l + 16A! + 45P+15i^), 

= m^(l + 36fe + 378/c2 + 1260fc3+ 946J:4), 

/*i = mi2(i + 66^; + 14g5p.,.1386OFH-51976A:H62370P+ 10395/^®). 
Hence the moments about the mean ot*(1 + Sk), and the oumulants, are 


=p'i == m®(l + 3fc), 

K^ = — 3m®i{3 + 12^ + 6&2), 


ATj == = 54m9A:2(3 + 16/S: + 15*2), 

= 27mi2*2(9 + 240*+ 1326*2+ 1920F+ 386*4), 
Kn = 648»n.42*3(7 + 48* + 76*2 + ISF). 


( 1 ) 


It is to be noted that in these calculations, and in the majority, though not 
all, those of this paper, the expressions for the moments about the mean are 
simpler than those for the moments about zero. Successive moments about the 
mean are therefore best calculated from the former, so far as possible, rather 
than the latter. Thus the equation 


Pi = A4"/*i{4/is + 6/i>2 + /*?). 
involves less algebra than the more usual 


It follows that 

(T = 3m2*l(l + 4* + |-*2)t, 
(9fc+36fc2+15*2)i 
1 + 3* 

+ 6^(3*) (3 +16* +16*2) 

(3 +12* + 5*2)1 
+ 72*(7 + 48* + 75*2 + 16*2) 
(3 + 12* + 6*2)2 
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Hence the distribution is positively skew, and leptokurtic. If c is small,' we 
have, approximately, 2c3 

ri = 2c+^+.... 

66c* 16c4 
7a = -^+-27-+ 

Since in most practical cases c* < 0*01, the first term will have an error of less 
than 0-1 %, and no attention need be paid to the later terms. 



3, Distribution ov any bower oe a normal variate 


It is required to find the moments of to”{ 1 4- The mean value of its rth 
power is clearly 

(***•) !^ I (***•) 1 1 1 

L 2{nr-.2)! 8(nf-4)l. ••• 2*.sl(nr-2s)r 

This converges for all positive values of n, since k is small, though it only 
terminates for positive integral values of nr. After somewhat tedious algebra, 
we find for the mean and other moments and cumulants: 

Xi = /ij = m"'[l + |w(r- 1)1:+ ...], 

Xg = = m}^n^k[\ + \{n- l)(3n-'6)fc+..,], 

^3 = /<a == l)P[l+^{17R*-56n + 44)A:+ ...], ■ (3) 

~ 3m*"7i*fc*[l+f(«.— 1)(5 r— 7)1:+ ...], 

X4 = 4 «i*"w^(n — 1 )( 4 » — 6)P[1 + 0(A:)]. 

Hence c = »1:*[1 + J(n — 1)(3»— 6)1: + ...], 

74 = 3(n - 1 ) l:i[ I + ^(7 tc* - 38n + 43) fc + . . . ], 

72 = 4(?i- 1) {4n, - 5) 1:[1 + 0(1:)]. 

And when c is small o/ i , 

Tv / 

When R = 2, the expressions become very simple, 

Xi = m*(l+1:), 

Xg = 2m*k(2 + k), 

Xs = Bm^k^d+k), (5) 


Hence 


X, = (r - 1 ) 1 m*-(2kY~^ (r + 1;). j 

This is in accordance with Craig’s formula, putting ki = — k, p 1. 
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4. DlSTBlBtTTION OF THE PBODtJCT OF THREE INDEPENDENT 
NORMAL VARIATES 


Let X, y, z, be independent reduced normal variates, so that — I, 

** = 2/* = 2* = 3, etc. Let X—mi{l + k^x), Y=m^{l + ¥y), Z = 1113(1 + fc|2;). 
Here kl, kl are coefficients of variation of linear measurements, so that k^, k^, k^, 
are commonly about O-OOl. It is required to find the moments of the distribution 
ofXYZ. In the expansion of a power of JT F.Z we need only consider even powers 
of X, y and z. For example, 

= -^k^x^) {l + k^y^) {l + k^z^) 

~ m5m|m|(l + fcj) (1 + fcj) (1 + h). 

So if miWgmj = F (a volume), the moments of XYZ about zero are 


+ (1+^2) (1+^3)) 

/i 3 = ■F^(l + 3 fci)(l + 3^:2) (1 + 3^53), 

= F^(l + 6k^ + Ul) (1 + &k^ + Skl) (1 + 6^:3+ 3fc|). 

So Xk-j^k^~{- k-^k^k^), 

~ = GV^{Xk^k^-{- 4:kik2kf^, 

= 3F*(Ffc|+ 2Ekik2 + &Xklk2+ ^Bkj^k2k2+ 3Z'fc|fc| 

+ 3QSklk2k2+ l8Xklk2k2 + dklklkl 

Ki = QV*[2Xk\k2+ Xkfkl+ ikik2k2{4: + 4Xki + 2Xkj^k2 + kj^k2k2)]. 

In practice we can neglect all terms except the leading one, and write 

= F 

K 2 = F^lfci + iia+fca), 

IC 2 = QV^{k2k2 + k2ky^ + kj^k2), 

A4 = l2y^Qc\k2 + k\k2 + klk^ + k^k\ + k2kl + k2k\-\-8k^k2k2). 

If ki = k 2 = ka — k, we find 
Ai = F 

K 2 = F^A:(3 + 3A: + ¥), 

A3 = 6F3F(3 + 4jfc), 

A4 = 6F^A;®(28 + 61fc+24P + 4/i:®),, 
which may be compared with equations (1). Thus 

c = f{'ik + + ¥), 


( 6 ) 


(7) 


56c® 22g* 
"" ~9 ^ 


"j- . . .. 


(8) 
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It will be noticed that the leading terms of equations (2) and (8) are identical. 
Hence the two distributions will, in practice, be quite indistinguishable, even if 
tens of thousands of individuals are weighed. It follows that the cube root of the 
product of three independent variates is almost normally distributed. Further, 
the relations of and f o ^ are little altered when and kg are different, 
provided that they are of the same order of magnitude. Thus if = 0-001, 

11 

kg — 0-002, kg = 0-003, we find y, = — -c instead of 2c, and y* = — r— = 5-3c^ 

3 o 

instead of 6-2c^. Only a very large sample indeed would reveal departures of this 
order from the simpler expressions of equaitions (2) and (8). 


6. Distribution oi- the product oe n independent normal variates 

Let Xi, Xg, Xg, Xr, ..., x„, be n. independent reduced normal variates. The 
general case is of course somewhat complicated, so we shall only investigate the 
special case where all the coefficients of variation are equal. 

Then if X, = + k^x^), and M = the moments about zero of the 

distribution of the product UX^ are 

/I's = M^(l + 3k}^, 

M^(l + 6k + 3kY, 

ll'g = M5(l + 10fc+16A!2)«, 

/i' = 15A: + 46F+15P)". 

Hence 

/i2== Jf2[(i + fc)»_i], 

/tg = M3[(l + 3A:)" - 3(1 + + 2], 

= M\{\ + &k + 3A;2)« - 4(1 + 3A;f+ 6(1 + h)^ - 3], 

[jLg = if5[(l + 10fc+ IbPf - 5(1 + 6fc + 3F)™+ 10(1 + 3Af- 10(1 + *)”- 4], 

/tg = Jlif8[(l + 15fc+45A:2+15A:3)»- 6(1 + 10fc+16fcY+ 15(1 + 6fc + 3A:Y 

- 20(1 + 3A!)"-f 16(1 + if - 5], 

Ki = M^[(l -f 6fc+ 3F)«- 3(1 + 2fc + &*)«- 4(1 + 3fc)"+ 12(1 + A:)",- 6], 

Kg = lf8{(l + lOfc + 16h2)« + 5[ - (1 + 6fc+ - 2(1 + 4fc + 3 fc 2 )» 

+ 6(1 + 2* + A;®)'*' + 4(1 + 3A;)"- 12(1 + fc)»] + 24}, 

M^{{l + 15k + 4:5¥ + 1 5A!*)'>‘ + 1 6[2( l + 3k + 3k^ + F)« 

- ( 1 + 7A! + 9F + SP)" + 2( 1 + 6A: + 3&2)« + 8( 1 + 4A! + 3A:2)" 

- 18(1 + 2k + *2)™- 8(1 + 3i!)« + 24(1 + A:)«- 8] 

- 6(1 + lOA: + 15kY- 10(1 + 6A; + 9P)»}. 
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The higher cumulants can be evaluated as follows; 

i j (S*--! - 1 ) [(2 + ky - 4] k', etc. 

' Thus we find 

= M, 

K2 = MHk[l + U^-l)k + 

Ka = MH{n - 1 ) k\3 + 4(w - 2) i 4- . . 

K^ = l)P[4(4n — 6) + 3{13ft®— 49'a + 47)fc + ...]) 

/fg = m^n{n- 1) fc^[(6n- 6) (5u- 7) + 2(ft- 2) (143a- 345) [k 4- . ..], 

= 3if«a(» ~ 1) *®[432a'' - 1863a2 + 29i7a - 1768 + 0{/l;)]. 

These may be compared with equations (3). 

c = f{nk)[l + \{n~l)k+ ...], 

and when c is small ,, 

T.-5fc^[l+0OT], 

as in equations (4). 

When a == 2 we have the simple forms 
k-^ = M, 

= M^k{^ + k), 

.K^==mv, 

— 6Jlf^ifc®(4 + A), 


A-J, = (2r - 1 ) ! M<^k^-\2r + k), 

S’sr+i^ (2r + l)!M^+^k^. 

This is in accordance with Craig’s formula, putting k^ = k^ = k, p = 0. 

6. DiSTBIBUTION of the PBODUOT of two COERBIiATED NORMAL VARIATES 

In general the different linear dimensions of an organ or organism are posi- 
tively correlated. Organic correlations may reach very high values, such as 0-9, 
and presumably even higher values would be found for two approximately equal 
diameters of an approximate solid of revolution, such as an apple or an egg, where 
p may be taken as unity. On the other hand, quite low values are found. Thus the 
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length and breadth of a homogeneous group of like-sexed adult human skulls 
generally show a correlation of about 0*3 or 0*4. Hence the area of an organ will 
commonly be proportional to the product of two correlated variables, the 
volume to the product of three. 

Let X and y be two reduced normal variates, with correlation p. The cumulant- 
generating function for their joint distribution is + That is to say 

if r + 5 is odd, then = d. If r + s is even, then afy^ is the coefficient of — r in 

r!s! 

exp l{t^ + 2ptu+u^). That is to say, x'y^ is the coefficient of fM* in 




{t^ + 2piu+'a^)i<''+^\ 


multiplied by 


r\s] 


It follows that 


2 Kf+ 8 )[|(r+s)]!- 


X* *= 1, 

xy = p, 



X* = 3, 

x^y = 3p, 

3 . 2^2 = 1 .y 2 p*, 


II 

x®y == 15p, 

xy = 3(l-i-4p*), 

x®t/® = 3p(3-b2p*), 

X® = 106, 

x’y = 105p, 

x®y* — 16(1 -b6p*), 

sfiyH — I6p(3 + 4p*), 


= 3(3 + 24p®+ 8p*), 


= 106(l + 8p*), aj’y® = 316p(H-2/)^), a!®y* = 46(1 + 12p’* + 8p*), 

scy = 315(1 + 16p® + 16p*), 

Of course — 3f y^. 

Thus the moments of xy about its mean p are 

Pi=l+ p\ = 2p(3 +p2), = 3(3 + 14p® + 3p«). 

Hence — 6(1 + 6p2-)-p4). Hence the distribution is leptokurtic, and asym- 
metrical unless p vanishes. 

Now consider two correlated normal variates 

X — mi(l + ax), and Y = ln.2(l + &2/)> 

where a and 6 are coefficients of variation. Let A. Then the moments of 

Zy about zero are 

.p,[ = A(l+pab), 

/ta — ri.*[l a® -i 4po6 -f 6* -f (1 -1- 2p®) 0*6®], 

= ri®[l -)- 3(0* -f 3pab + 6*) -f 9o6{pa* -t ( I + 2p*) ab + pb^} + 3p(3 -f 2p*) o®6»], 

X == A*[l + 2{3a^ + 8pab + ^^) + ^pa*+npa^b + l2(l + 2p^)aW+lQpab^ + 3pb* 
+ 6o*6*{3( 1 + 4p*) o* -f 8p(3 -f 2p*) o6 -t 3( 1 + 4p*) 6*} -+■ 3(3 -b 24p» ■+ 8p*) 0*6®]. 
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Hence 

= /“a = A\a^+2pab + b^ + {l+p^)a%^], 

/fg = = 2A^ab[3{pa^+il+p^)ab+pb^}+p(B+p^)a%% 

== 3^3[((ja + 2^cf6 + 62)2 4. 2a262|(3 7^2) »2 + 2/3(7 + 3p2) ah 

+ (3 + 7p2) 62} + (3 + 14p2 + 3p4) ^4^4]^ 

/C 4 = 6 ^W[ 2 {(l + 3p2)a.2 + 2p(3+p2)a6 + (H-3yo2)6a} 

+ (l + 6p2 + p^)a®62], (10) 

in accordance with Craig’s formula. 

The most interesting case arises when a^h — lA. This case is important 
because in practice the coefficients of variation of different linear dimensions of 
the same organ are often nearly equal. Thus those of linear skull measurements 
in like-sexed adults in a racially homogeneous population are about 0'03, so that 
% is about 0-001. The moments and cuinulants of XT are then 


Xi== A{l+pk), 

Ki- Pz- A^h[2{\+p) + {\ + p'^)lc\, 

Kz^Pz^‘lA^h\^\+pf^{i+p^)k], 

Pi = 3A^jl:2j-4(i 4.p)2 + 4(3 ^ 7p4- 7p2 + 3p«) ^; + (3 + 14p2 + 3p«) 

Ki = 6^4P[4(l+p)3 + (H-6p2+p4)A]. (11) 

Hence 




while 


71 = |c[l + 0(c% 

72 = 3c2[1 + 0(c'=)], 

exactly as found when the two variates are in a constant ratio or quite indepen- 
dent. When however a =t= 6, this is no longer the case, for it is clear that the distribu- 

tion becomes normal when a or 6 vanishes. If = p, so that 73 > 1, we have, 


for equations (10), 
yi = 


1 + 2pp + p2 3c 
' {P + Pf ■ 


72 = 


2a6 

1 + Bpp - 3p^ +pp^ 

ip+pf 




both approximately. However, a and 6 may differ considerably without any very 
great effect of yjc or y^/c. Thus if o = 26, so that p = f , 

8(2-f.6p + 2p2)3c 
(5-|-4:p)2 2' 


That is to s^y, 7^ is 64 % of 3c/2 if p = 0, and 86 % if p = i, whilst 7a is 51 % of 
3c if p == 0, and 71 % if p = J. 

So far we have assumed that p is not negative, Ifp=- 1 , Xy== l—kx^, 
so the mean is J[(l - h), and the other cumulants are given hy 


K, = (~)'-2’-'i(r-l)!4’-jfcr. 
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= — 1 + 2 + + in equations (11) vanishes, though the distribution 

does not become quite symmetrical. However, negative values of p are of no 
biological interest. 

7. Distribution or the product or three correlated normal variates 

Let X, y, z, be three reduced normal variates as before, their correlations being 
Py„ = K, p^ = A, p^y = p. Thus the cumulant-generating function is 

+ u^ + v^ + 2 kuv + 2Xvt + 2ptu). 

Odd moments vanish, and the even moment xPy^z’’ is the coefficient of in 
(i* + u‘^ + v^ + 2 kuv + 2Xvt + 

multiplied by ' 

The required moments of products of two variates, such as 
xy = p, a;V = 3(1 + 4/*^). 

have already been given. The required moments of products of aU three are: 
x^z = K + 2kp, 

x^z = 3(A + 4Ap), x^y^z = Z[X + 2 kp^-2kp'^), x^y'^z'^ = \ + 2 Sk'‘‘+%kXp, 

xXi^z — 3(3a + 12A/i + 12/c/i® + 8A/i®), 

a;^ V — 3(14. 2 k2 + 4 A 2 + 4 /t^ + 1 QkXp + %X^p^), 

3?yH^ = 3(3^ + 6 aA + 2p? + ^K‘‘p + 6A*p + l2KXp^), 

x*y^^ = 3(3/c + 12 a® + 12A® + 24p® + 2&KXp + i^K^p^ + 48A®p® + Sp* + 64aA/*®), 

~ 9(3a+ 12Ap+ 2 a® + 12aA®+ 12 a/(® + 24A®A/t + 8A®/M 

+ SX^p + 8Ap® + 24 aA®p®), 

x^ = 9[3 + 24Z'a® + 8i:A* + 96Z'A®/i® + MkXp{3 + 2 Z'a® + 3aA/i)] . 

Other moments (except those such as xyH = A + 2kp, which are derivable by 
transposition) are not needed for our purposes. It follows that xyz has a sym- 
metrical and leptokurtic distribution, with mean zero, and 

Aj = Pa = 1 + SATa® + 8aAp, 

A4 = 12[2+172;A2 + 5Z'A* + 702:A®p®+4AAp(35+22rA®+32AAp)]. 

If X == mj_{l+ax), Y= m^(^l + by), Z = m^il + cz) the expressions for the 
higher moments are very complicated, though it can easily be shown that the 
mean is + icbc + Xca + pab), and the variance 

m|m|m|[ra® + 2XKbG + i:( 1 + a®) 6 ®c® + 2r(2A + 3Ap) a®6c 

+ ( 1 + 2i7A® + 8 aAp) «® 6 ®c®]. 

x6 
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We shall only give further consideration to the case where all the coefficients 
of variation are equal. And we shall confine ourselves to two special cases of this. 
In the one k = X = ^ ~ p, that is to say the variates are equally, and therefore of 
course positively, correlated. This is not very far from the case with the human 
skull. In the other, fc = 1, whilst A = /t = p. This is appropriate to an approximate 
solid of revolution, such as many eggs and fruits, or to a regular prism such as 
some sponge spicules. 

In the first case we have for the naoments of products of three variables: 

x^z = p{l + 2p), 

x^z = 3p( 1 + ip), x^y^z = 3p(l + 2p + 2p2), = 1 + hp® + 8p®, 

x^yh = 3p(34- 12p+ 12p2 + 8p®), a;VV ~ 3(1 -h lOp^H- 16p® + 8p*), 

iyia = 3^(3 6^ + i4pa 4. 12^8)^ 

xY^^ = 3(3 + 48p2 + 96p3 + 104p« + 6ip% 

xY^^ = 9p(3 + 12p + 26p2 + 40p® + 24p^), 

xY^* = 27(1 -t- 24p* + 64p® + 104p* + 128p® + 64p®). 

So if Z = ?Wi(l + l:ia:), 7=m2{l + kY> Z — m^il + kh), and mimjTOg == 1^ 
the volume which is the product of the means, then the moments of Z FZ about 
zero are 

pi = F(l + 3pl:), 

p' == F2[l + 3(l+4p)fc + 3(l + 4p+10p2)A:+(14-6pH8p3)/fc2], 

p' = F3[l + 9(H-3p)l: + 27(l + 6pH-8p2)jfca + 9(3 + 21p + 54p2 

+ 62p^) F + 27p(3 + 6p + 14p2 + 1 2p3) k% 
pi = F^[l + 6(3 + 8p)fc + 9(13 + 64p + 88p2)fc2+36(9H-64p+160p2 

+ 162p®) F+ 27(13 + 128p + 448p2 + 768p3+ 568p4) k* 

+ 54(3 + 24p + 144p2 + 304p3 4. 424^* + 256p6) k^ 

+ 27(1 + 24pa+ 64p3 + 104p^ + 128pB + 64pe) F]. 

Hence 

Ki~p'i = F(l + 3pjfc), 

= p,^ = F2A:[3(1 + 2p) + 3(1 + 4p + 7p2) i + (1 + ^ + 8p®) 

ATj = P3 = 6F3A2[3(1 + 2p)2 + (41 + 27p + 60p2 + SSp^) k 

+ 3p(4 + 9p + 8p^ + 14p8) 
p^ = 3F^P[9(1 + 2p)2 + 2(37 + 222p+ 471pa + 350p») k + 3(39 + 316p 
+ I038p2 + I584p3 + 100 lp«) + 8(3 + 24p + 127p2 + 268p® 

+ 346pi + 192p6) P + 9(1 + 24p2 + 64p3 + 104p4 + 128p® + 64p8) P], 

Ki = 6F*P[28(1 + 2p)8 + 3(17 + 144p + 46.8p* + 688p8 + 4Hp4) k 

+ 12(.2 + 17p + 92pa+ 193p3 + 241p* + 130pS) P 
+ 2(2 + 51p'^+ 140p8+ 225p*+264p5 + 128p«)P]. (12) 
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These expressions reduce to equations (1) if p = 1, and (7) if p = 0. When k 
is smaU, c2 = 3(1 + 2p) k, ^ 2o[l + 0(0®)], 

r2=-~[i+o(c^)]. 

Thus the relation of and to c is almost independent of the coefficient of 
correlation. 

We next consider the case when z = y,&o that x = 1, whilst X= [i = p. This 
is most simply solved by finding the moments of [\ + 'Bx){l + k^yf‘. Thus if 
X = m^{\ + k^x), and Y = + k^y), the moments of X about aero are 

pl = F[l + (l + 2p)i], 

p' = F2[l + (7 + 8p)*+3(3+8p + 4p2)F+3(l + 4:p2)P], 

Pa = F®[l + 18(l + p)fc+18(5+llp + 5p2)ii;2+30(5+15p+18p‘^ + 4p3)fc» 

+ 45(l + 6p + 6p2+8p3)i:*], 

p' = F<[l + 2{17+16p)&+3(127 + 266p+112p2)fcH84(21 + 64p + 64p2 
+ 16p3)P+ 105(31 + 128p+ 192pH 128pH IBp-^) 

+ 630(3 + 16p + 32p2+ 32p3+ 16p*) jfcH 316(1 + 16p2+ 16p^) ¥]. 

Hence 

/fj =p' = F[l + (l + 2p)H 

'^2 = ^2= F21b[6 + 4p + 4(2 + 6p + 2p*)fc+3(l + 4p2)fc2], 

2F3A:2[3(8 + 14p + Sp*) + 2(29 + 84p + 8^p^ + 16p®) k 

+ 9(2+ 14p + 13p2+ 16p») ifc*], 

p^ = 3F^P[(6 + 4pF + 8(40 + 1 13p + 95p2 + 22p8) + 2(427 
+ 1628p + 2376p2+ 1376p*+ 160p3+ 160p^) P 
+ 24(24 + 121p + 237pi‘ + 234p» + 104p*) fc» + 105(1 + 16p2 + 16p4) k^], 

X4 = 24F*F[30 + 80p + 66p2 + 14p3 + (96 + 364p + 613p2 + 292p® 

+ 32p«) A: + 3(22 + 1 1 6p + 227 p^ + 2 14p® + 96p^) k^ 

+ 3(4+67p2 + 64p*)A;®i. (13) 

Hence, approximately, 

c® = (6 + 4p) k, 

6(2+p) (4+6p)c 
(6 + 4p)* 

24(30 + 80p' + 65p2 + 14p3) c® 

'^2- (6 + 4p)3 

Hence yj varies between f|c, or l-92o when p = 0, and 2c when p = 1. This is 
its TnaxiTnum , so for high values of p, yjc is nearly constant. It vanishes when 
p = _0-8. ya increases from or 5-76c2, when p == 0, to or 8-2c^ when 

16-2 
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p = 1. This again is its maximum, so yj/c* is nearly constant for large p. does 
not vanish for any admissible values of p. In fact for positive values of p it would 
be impossible except with enormous samples, to distinguish this distribution 
from that of the cube of a normal variate. If p = 1, equations (13) reduce to 
equations (1). If p = 0, they give the cumulants of XY^, where X and Y are 
uncorrelated normal variates with the same coefficient of variation, namely, 

/c^ := F(1 + Ic), = V^k[5 + 8* + 3*2)^ 

/Cg = 4F2ifc2(12 + 2%h + 9^2). = 24F«/t2(3o + 95^, + 66*2 + 12^:3), 

which may easily be obtained independently. 


8. The prodtjct of n correlated normal variates 


If the variates are Xj^, Xj, X,., ..., X^, where X,. = m,,(l +a,.a;,.), and 
P = i7m„ while a:,, is a reduced normal variate, and p„ is the coefficient of corre- 
lation of and x^, and hence of X, and Xg, the general expression for the moments 
of Xi, Xa, . . . , X„ is complicated. It can however easily be seen that the mean is ; 

■^[1 "t 4- Y{PfgP^^^ -f priPsu + PruPls) ■!■•••]) 

while Pa = P\Sal+ 22’p„a,.ag-}- ...]. 


If every and every p^g = p, then vanishes when m = Soc^ 

is odd, and when m is even it is the coefficient of tf'tp ... in the expansion of 
{Xt'} + 2pXt^tg)^, multiplied by tx^\a2\oc^\l(2”''/nl). Thus the moments about 
zero are: 


pl = P 


So 


l + ln{n-l)pk+^n{n-l){n~2) (w - 3 ) p^k^ +... + ! + -■]’ 

P'2 = P*[l-l-‘a(H-2('ri-l)p}fc4-^'a('«.— 1) 

X {l-l-4(-n.— 2)p-l- 2(2n.2 — 6 th- &)p^}k^+ 

/(g = P3[l -I- 3?i{l •+■ |(?i - Ip} fc 4- ^n{n — 1) 

X {Iq-CSu— 4)p4-^(9'u2 — 21W4- 14) p^}k^y - ...], 


Pa = P^nk[l 4- (ra - 1) p -h i(Ti - 1 ) {1 4- 4(71 - 2) p 4- {3n^ - Qti 4- 7) p^} A: 4- . . 

Ps= 3P27i(Ti-l)[l4-(n-l)p2fc2[l4-0(fc)], (14) 

Thus c = [77{l4-(77,-l)p}*]i[l4-0(ifc)], 

T 4- (to- 1) P'1* 


71 


= 3(TO-1)[1±<^J 


[1 + 0(^)1 


(16) 


TO 


c2[l4-0(c2)], 


and presumably 72 etc. are approximately the same functions of c as in the case 
of equations (9). 
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9. The GaltoN'Maoalisteb distbibtttion 

Consider the distribution of the wth power of a normal variate when n ia very 
large, and k very small, but nU remains constant. All but a few terms of equa- 
tions (3) will vanish, and if nk'‘ is small, the distribution becomes identical with 
the distribution described by MacAMster (1879) whose first four moments have 
been given by Pearson (1905). Its oumulants can readily be found as follows: 
Given that a; = log A is normally distributed, to find the distribution of X. Let 
m and s be the mean and standard deviation of x. Then X = e®. Hence, since the 
moment-generating function of x is M(t) = the rth moment of X about 

zero is , — 

= erx ^ ^ M{r) = e.^r+w^^ 

where etc. are the moments of x about zero. Let M = j = Then 

fi'f ~ Hence the oumulants of X are 

/fj = M, 

/Cg = Af3(Z-l)2(Z-f-2), 

= M\l - 1 )3 (Z3 -f- 3^2 -f 6Z -f- 6), 

<6 = M^{1 - 1)^ (Z8 -f 4ZB -f lOZ^ •+- 2QV -t 30Z2 + 36Z -f 24), 

<6 == M8(Z - 1 )® (Zio -f 6Z» -f 1 6Z8 -f 36Z’ -f 70Z8 + 1 20Z5 -i- 1 80Z« 

+ 24:01^ + 2101^ + 24:01 + 120), (16) 

or, if Z ~ 1 is a small quantity, q, approximating to s, 

ATi = ilf, 

Ag = M% 

K^ = M^q\2 + q), 

K^ = M^q^{lQ+l5q+6q^ + q% 

Ag = ifV(126-t-222g-l-206g2-f 120g'*-|-46g«-l- I0q^+q% 

/Cg = iM'Y(1296-|-360Og-t-67OOg2-)-5166g»-|-4946g«-t2997g'5 

-t 1366g®-t- 465g’-t 106g8 -i- 15g2 + q^®). 

Hence c^ = q, 

= 3c -f- c®, 

Yg = 16 c 2 -M 6 c^-|- ..., 

Ya = 126c®-l-222c6-|-..., 

Y 4 = 1 296c* + 3660c® -t.... 

The first terms represent the limiting values of equations (3) and (9). The 
leading term of Yr f®’ 74 (*" + 2)’". Thus ?or a given coefficient of 

variation, Yi is 60 % greater than in the case of the cubed normal variate, Yg is 
167 % greater. 
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10, Biologioai, App]:j:oATioi>rs 


Wilson and Hilferty (1931) showed that the cube root of is almost normally 
distributed when n exceeds 2. Haldane (1938) gave the value of the cumulants 
in this case, and showed that, for large values of to, ( 1 3%® — to)’ ® is even more nearly 
normally distributed. It is interesting to compare the cumulants of the cubed- 
normal distribution [equations (2)] with those of the distribution. The latter 

/2 

are k-^ = to, Xg = 2 to, = Sto, = 48to, etc., so that c = -,71 = 2c, = 6c®. 

M 7h 

Thus if we compare the distribution with a cubed-normal distribution of the 
same coefficient of variation, we find that they are equally asymmetrical, but 
that the distribution has a which is fl-th that of the cubed normal. Hence 
the same transformation will nearly abolish both Xj and ic^ for both distributions. 

The success of Wilson and Hilferty’s transformation suggests strongly that 
we may use equations (1), (5) or (3) for the approximate normalization of moder- 
ately skew variates. This is an urgent problem in several applications of statistics 
to biology (Haldane, 1939). We evaluate a number whose mean and standard 
deviation in the case of random sampling are known. We desire to know whether 
it differs significantly from the mean. But the sampling distribution is found to 
be skew. If we can approximately normalize it, our tests of significance become 
far sharper. This problem is taken up in detail elsewhere. If yf > and both 

ate small, so that can be neglected, we can find m, to and k so that {m + k^x)'”' 
has a given y^ and By equations (3) and (4), 


TO = 14- 


4:71 

lQYl~9yf 


k^ 


7l 

9(to-1)2’ 



If, however, is not small, it is necessary to take several terms of equations (3), 
If in an observed distribution of weights or volumes, the estimate of y^/c, 
or of Xi/Cg/xl is approximately 2, it will be reasonable to try whether yjc^ or 
kIkJkI approximates to and if so to try to fit a normal distribution to the 
cube roots of the variate. It will be seen that this does not imply that all the 
objects considered are of the same shape. On the. contrary, such a distribution 
is to be expected whenever three mutually perpendicular measurements are 
normally distributed, provided that their coefficients of variation and correlation 
are not very different. And even the greatest differences in the latter, provided 
they are not negative, will have little effect. 

Unfortunately, reliable estimates of 7 i and y^ can only be obtained from 
samples of the order of 1000 or more. Rendel (in a paper to be published shortly) 
obtained the following estimates for the cumulants of the weight distribution of 
1202 viable duck eggs, corrected for grouping. The unit is a gram: 

fci = 73-294, k^ = 42-69, kg = 4-118-668, = 4- 937-21. 
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Hence the estimates of c, and y^ are 

c = 0-0887 ±0-0018, grj = +0-432 + 0-071, = +0-624 + 0-142. 

The standard errors are those appropriate to a normal distribution, but the true 
values cannot be very different. It is clear that and are significantly greater 
than the values of 0-27 and 0-13 which we should expect were the logarithms 
of weights normally distributed. They differ still more from what would be ex- 
pected were the cube roots of weights normally distributed, or on any other 
hypothesis leading to a similar distribution. 

Pearl (1906) lists the moments of the distributions of brain weights of eight 
European populations of 197 to 529 individuals. The various estimates of c are 
all close to 0-080, ranging from 0-074 to 0-083. Those of ji range from + 0-11 to 
+0-40, those of y 2 from -0-30 to +1-5, Tire weighted means are 

c = 0-07966 + 0-00106, = +0-2306 + 0-0461, gg = +0-2661 + 0-0922. 

If the cube roots are normally distributed, we should expect = +0-16, 
g '2 = +0-037; if their logarithms are normally distributed, we should expect 
g.^ = +0-24, = +0-10. The latter distribution gives the better fit, but the first 

is not impossible. 

Sinnott (1937) gives a graph of the distribution of the weights of squash fruits 
in an F^, which is positively skew. He shows that a graph of the distribution of 
their logarithms, though negatively skew, is more symmetrical. There is a sug- 
gestion that a graph of the distribution of their cube roots would be even more so. 
Unfortunately the actual figures are not given, and since curve-fitting by eye is 
notoriously uncertain, no more can be said. It is much to be desired that, when 
the full data are not given, estimates of the first four moments or cumulants 
should be published. 


11. Summary 

The first four moments of a number of powers and products of normal variates 
are calculated, with special reference to the probable distributions of weights, 
volumes, or areas of organs and organisms. In each case the first two measures 
(y^ and y 2 or and /ffg) of deviation from normality are obtained in terms of the 
coefficients of variation. The expressions obtained are almost independent of the 
correlation between the linear measurements, provided the coefficient of varia- 
tion of the latter are approximately equal. The distribution found is perhaps 
applicable to data on brain weights, hut not to data on ducks’ eggs. 
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THE TEANSFORMATION OF DATA FROM ENTOMOLOGICAL 
FIELD EXPERIMENTS SO THAT THE ANALYSIS 
OF VARIANCE BECOMES APPLICABLEf 

By GEOFFREY BEALL 

Dominion Entomological Laboratory, Ohatham, Ontario, Oanada 
1. Introduotoey 

The present paper deals -with experiments on the control of insects in the field. 
In such experimental work the problem to be investigated is whether more insects 
survive on plots which have been subjected to one treatment than on plots 
subjected to another. It will be shown in the present paper that the numbers of 
insects found per plot must vary in such a way that one cannot, strictly, subject 
the results to the analysis of variance, and it is proposed to find how the data 
may be transformed so that analysis of variance becomes applicable. Such 
transformation has been discussed by Bartlett (1936 a, 6) in connexion with 
entomological experiments, and by Tippett (1934) in connexion with industrial 
experiments. 

2. Exeerimehtal eesults considered 

The data used in the following work are results from seven insecticidal 
experiments arranged by the author at Chatham, Ontario. The work was carried 
out with replicated blocks containing plots subjected to treatments of which the 
assignment was random. This procedure, normal in agronomic work, was supple- 
mented by one repetition of each treatment within a block. The assignment of 
the repetition of a treatment was independent of the first for that treatment, 
except that, of course, the same plot could not be chosen twice. This repetition 
was carried out to obtain estimates of variabihty within blocks. In these experi- 
ments complete counts were not made but random sampling was employed. 
Experiments on Pyrausta nubilalis Hubn., reported by Beall et al. (1939), for 
which results are shown in Tables 1 and 2, were made on one area at two different 
periods, whereas experiments on Leptinotarsa decemlinmta Say, for which results 
are indicated in Tables 3 and 4, were carried out on contiguous areas at the same 
time. Three similar experiments were carried out in one place on the tobacco 
hornworm, PUegethontius qninquernaculata Haw., for which the data are shown 
in Tables 5-7. Reference is also made to the data from a uniformity trial on 
insects of Beall (1939), 

t Publication No. 2101, Division of Entomologyj Science Service, Department of Agriculture, 
Ottawa, Canada. 
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Table 1. Numbers of an insect, Pyrausta nubilalis, per plot. Experiment I 



Table 2. Numbers of an insect, Pyrausta nubUalis, per plot. Experiment II 
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Table 3. Numbers of an insect, Leptinotarsa deoemlineata, 
per plot. Experiment III 



Table 4. Numbers of an insect, Leptinotarsa decemlineata, 
per plot. Experiment I V 



Table 5. Numbers of an insect, Phlegethontius quinquemaculata, 
per plot. Experiment V 
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Table 6. Numbers of an insect, Phlegethontius quinquemaculata, 
per plot. Experiment VI 


Treatment 

Block 

1 

2 

3 

4 

5 

6 

1 

12 

13 


4 

11 

4 

1 

13 

9 


7 

5 

10 

i 

13 

6 

8 

1 

5 

7 

2 

20 


9 

4 

12 

7 

3 

7 


6 

4 

8 

g 

3 

7 

9 

4 

7 

3 

2 

4 

1 

1 

0 

1 

4 

3 

4 


2 

2 

1 

4 

5 



7 

12 

3 

6 

11 

6 

mm 

mm 

4 

1 

9 

8 

6 

19 


8 

9 

6 

6 

6 

m 

m 

2 

4 

4 

12 


Table 7. Numbers of an insect, Phlegethontius qiiinquemaculata, 
per plot. Experiment VII 


Treatment 

Block 

1 

2 

3 

4 

6 

6 

1 

10 

20 

14 

10 

17 

14 

1 

7 

14 

12 

23 

20 

13 

2 

11 

21 

16 

17 

19 

7 

2 

17 

11 

14 

17 

21 

13 

3 

0 

7 

3 

2 

3 

1 

3 

1 

2 

1 

1 

0 

4 

4 

3 

12 

4 

6 

S 

2 

4 

6 

6 

3 

6 

6 

4 

6 

3 

3 

3 

1 

3 

6 

5 

5 

6 

6 

1 

2 

4 

6 

11 

16 

15 

13 

26 

24 

6 

9 

22 

16 

10 

26 

13 


3. The eelatiohship between the standaed deviation and the mean 

IN THE BXPEEIMENTAL DATA 

If X is the number of insects on one of a group of small contiguous areas, say 
plots, within a larger area, say a block, let the expectation of x over all these 
plots be M and the standard deviation be cr; then over a number of the larger 
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areas, when the insects are distributed in a completely random fashion, from the 
Poisson distribution, = M. ( 1 ) 

As is discussed by ‘Student’ (1919) one cannot, however, anticipate that (1) will 
be satisfied when organisms occur in groups, as, say, when insects come from 
masses of eggs, or when there is a change in expectation from plot to plot within 
a block. Generally, cr* will tend to be greater than M and we can only say 

(r^=f{M). (2) 

The form of /(if), in (2), must be considered carefully, since it bears on the form 
of the transformation which may be developed to make the standard deviation 
independent of the mean. 

In dealing with (2), Bartlett (1936a) started by supposing that, approximately, 

0-2 = KM, (3) 

where K is a constant. Generally, in field data, however, the relationship between 
cr® and M, or of their respective estimates, and x, does not, as in Pig. 1, appear 
to be linear; rather, the departure of from x becomes disproportionately great 
as X increases. This relationship between departures and the magnitude of the 
mean has been discussed by Clapham (1936) in connexion with data on the 
distribution of organisms differing from insects as much as flowering plants, and 
he showed that only those distributions with very low mean have the squared 
standard deviation close to the mean. 

Our discussion above on the shortcomings of (3) suggests the conclusion that 

a-2-MocJf (4) 

is generally untrue. We propose to consider the possibility that the ourvilinearity 
of (2) might be better met by supposing that 

(5) 

Equation (5) leads to = JIf + kM^, (6) 

where kis& constant. It will be noticed that 

fc = {<r2-ilf)iff-2 (7) 

is the Charher coefficient of disturbance from a Poisson distribution. This 
coefficient was employed by Beall (1935). 

It is possible to consider the suitability of (3)^, as compared with (6), by finding 
how, respectively, they fit observations on and x. To fit exactly is difficult, and 
it was found necessary to fall back on an empirical determination of K and of k\ 
thus, if there are a number of pairs of estimates, x and s*, from (3) and (6) we 
estimate K = Ss^/Ex, (8) 

k = {Ss'^-Ex)lSx\ (9) 

where E represents the summation over all pairs. 
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Since in the work presented in § 2, S and s^, being based on only two obser- 
vations, are highly variable, these experiments do not show clearly the suitability 
of (3) and (6). Accordingly, reference is made instead to the data from the uni- 
formity trial on Leptinotarsa decemlineata Say of BeaU (1939). When the mean 
and standard deviation of 144 sampling units within each of 16 areas were con- 
sidered, the estimates from (8) and (9) were K = 2-405 and k = 0-2548. Bor 
these values from (1), (3) and (6), curves, described as lines 1, 2 and 3 respectively, 
are plotted in Fig. 1 , the observed values of mean and squared standard deviation 



Pig. 1. The squared standard deyiation plotted against the mean for 144 small areas within each 
of 16 large areas; line 1 is from equatiou (1), line 2 from (3) and line 3 from (6). The counts had 
been made on Leptinotarsa decemlineata Say. 

are also shown. In the cases where the mean is near unity the departure of the 
squared standard deviation from the mean, i.e. from line 1, appears to be trivial, 
but as the mean increases the departure becomes more marked. It can. be seen 
that the observations lie more snugly about Kne 3 from (6) than about line 2 
from (3). Generally, for the data from field studies the same effect has been 
observed. Such results suggest that (6) may be generally a better approximation 
to the form of /(iff) than (3) and make it preferable to proceed with the analysis 
of data from the assumption (6). 

4. The transformations of field data 

Fig. 1 shows clearly how, within an area, the variability of the numbers of 
insects on sub-areas is related to the mean number of insects per sub-area. This 
relationship will make invalid the use of the analysis of variance on experimental 
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results involving counts on insects, since the expectation of the variance 
should be the same for all plots. To overcome this invalidity, Bartlett (1936®) 
suggested transforming the observations, x, from the basis of (3) . The transforma- 
tion found was which Bartlett modified to Prom §3 it was seen, 

however, that for field data the relationship between standard deviation and 
mean may be represented better by equation (6) than by (3), and, since the form 
of the transformation depends on the form of f{M), a fresh transformation must 
be sought. A transformation, as is developed in the Appendix to the present paper, 
is suggested by the method of Tippett (1934), i.e. 

a:' = i-ismh~’-(fa)b (10) 

An advance note of this transformation was published by BeaU (1940). The 
adequacy of this transformation must be judged from the extent to which it 
stabilizes variability. In (10), if we express sinh-^ (■ha;)i, when lex < 1, as a well- 
known series, we have 

a:' = (11) 

where it is obvious that for k ~ Q, x' = a;l. Of course, for large values of kx, 
x', varies almost as log x, or as the log (a; -f 1 ) used by Williams (1937), and so our 
proposed expansion may be regarded, for practical purposes, as embracing the 
root and logarithmic transformations. 

Table 8 gives the transformation, (10), for a probable range of observations, 
X, and for k at intervals which will probably be close enough for practical purposes. 
This table was computed in part by inverse interpolation from the table of hyper- 
bolic functions of the Smithsonian Mathematical Tables (Becker & Van Orstrand, 
1931), and in part from (12). Should values of x' be required outside those of 
Table 8, these can conveniently be calculated from 

x' = k-^ logc {(i;a;)i -f ( 1 -f kx)^}, (12) 

In preparing Table 8 the question arose of whether, instead of deahng with 

sinh“’- (fa)i, one should not use fcAauib."'i(fci(a: + ^)*} in the same way as 
Bartlett (1936a) dealt with the transformation, (* + 1)1, instead of *1. This 
modification was rejected on the basis of results of the transformation, as dis- 
cussed in § 6, since it was found that the addition of ^ made little difference and 
did not give, consistently, an improvement. 

For field data, in making the transformation (10), it is necessary to estimate 
the value of k empirically by (9) for which estimates *, of the mean and s, of the 
standard deviation, must be found. The most obvious method in practice of making 
these estimates seems to be to put more than one plot subjected to a given treat- 
ment in a block and so to estimate the chance variation of results for a plot within 
a block. In the present work, as is discussed in § 2, two plots were subjected to a 
given treatment in each block and this is probably good practice. 



Table 8. The transformaMon a;' = h-i siiih~^ (kx)^ 
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262 Transformation of data from entomological field experiments 

In the special case where there are two plots for the ith treatment {i — 1 , ...,n) 
in the jth block ( j = 1 , . . . , i\^). and so two observations, and the estimate 
of the mean wUl be written a:y_ and of the squared standard deviation 

= (13) 

Then from (9), we estimate k from 

A; = 2! s S S S ( s S (®«i+*«2)4 (14) 

U=iy=r t=ii=i I U=ij=i / 

and the calculation is very light. 

5. Results showing the eefeot of the teansfobmation 

ON THE VABIABILITY OF DATA 

The adequacy of our proposed transformation may be judged in two ways: 
first, with respect to its effect, which we shall consider in the present section, on 
the differences between repetitions of a treatment within a block, and secondly, 
with respect to its effect, wliich we shall consider in § 6, on the behaviour of the 
quantities submitted to the analysis of variance. 

It is a fundamental assumption in the analysis of variance that the chance 
variability for each plot shall be, when the effect of block and of treatment are 
removed, normally distributed with a standard deviation common to all plots, 
in which situation of course the standard deviation of the chance variability for 
a given plot is independent of the expectation for that plot. In the data of the 
present work, where each treatment is repeated in each block, it is possible to 
examine the estimates of this standard deviation, Sy, and of the expectation, ajy . 
For a clear graphical illustration of the situation consider Fig. 2, as obtained from 
the original data of Experiment III on LepUnotarsa decemlineata, where Sy is 
plotted against Xy,, and contrast this situation with that obtaining for the 
corresponding quantities 5y and a;y , obtained after transformation (k = 0-08) 
in Fig. 3. 

In Fig. 2 the points are widely scattered as is natural from a sample of two; 
nevertheless, it is apparent that for the smallest values of a:y. the values of s^j 
are correspondingly small and faU in a close group. In Fig. 3 the cluster of obser- 
vations in the lower left-hand comer of the previous diagram has disappeared, 
and generally the scatter appears to be independent of a:y , so that apparently 
the transformation gave.satisfactory results. The nature of the material involved 
is such that it does not seem possible to examine the relationship under con- 
sideration more exactly, nor to summarize exactly the corresponding results for 
the other treatments; it can only be said that the same type of result appeared 
although the magnitude of the relationship before transformation depended on 
the magnitude of the differences between the effects of treatments. 

The results shown in Figs. 2 and 3 suggest that the proposed transformation 
has tended to make the standard deviation independent of the mean, in accordance 
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with, the assumptions underlying the analysis of variance. In using this procedure 
one actually assumes, more broadly, that a common standard deviation exists, so 
that the homoscedasticity of observations before and after transformation should 
he tested. Thus it is assumed that and are observations from a normal 



0 100 200 300 400 500 



Fig. 2. The standard deviation and moan as estimated from plots by pairs, 
with untransformed data on LeplirwUirsa decemlineata. 



Fig. 3. The standard deviation and mean as estimated from plots by pairs, with the transformed 
data on Leptinotarsa decemlineala Say, i.e. using x' = k-i sinh“^ (te)*. (A: = 0-08). 


population with a 


standard deviation, cr, which is independent of i and j. Then 
is distributed as wifb one degree of freedom.-|- Accordingly, 


f In the Zij test, discussed by Nayer (1936), this case of estimates of standard deviation with 
one degree of freedom is troublesome since zero values tend to arise when dealing with grouped or 
integral observations. When this is the case Lj , which is the ratio of an arithmetic to a geometric 
mean of sums of squares, cannot be calculated. The present treatment may therefore have a wider 
application. 
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Vn = (%-%2)/V2(r should be distributed normally with unit standard devia- 
tion for all i andy In order to test the hypothesis of normality with unit standard 
deviation it is only necessary to test for leptokurtosis; for the distribution must 
be symmetrical since the sign of differences, and therefore of y^, is a matter of 
chance. Since the number of items involved will almost certainly be < 100, and 
since the population mean is zero, the criterion of Geary (1935) will provide an 
appropriate test. In using this criterion we must find the ratio of the mean 
deviation to the standard deviation, i.e. 


I n -V 

= S S I %1 " 1 

U=U=1 


n N 


i=i;=l 


(16) 


Of course, values of may be calculated for transformed data by substitution 
of^fcforajyj,, 


Table 9. The test on the homoscedasticity of counts by plots 
within a block for six field experiments 





Lower 

fi% 

limit 



Value of k 

Transfonned 

(Beall) 

“n 

Departure 
by S.D. 


Departure 
by S.D. 

Eeti- 

mated 

Em- 

ployed 


Departure 

by 3.D. 

I 

70 

— 

0-341 

0'737 

0-6669 

-6-33 

0-7626 

-1-91 


mi 

0-7846 

-0-64 

ii 

70 

0841 

0'767 

0-7886 

-0-49 

0-7807 

-0-79 


^BB 

0-7499 


in 

28 


0'737 

0-6973 

-6-33 

0-6431 

-4-16 

B K B 

^BB 

0-6838 

-3-11 

IV 

24 

■is: 'I !■ 

0-732 

0-6692 

-8-67 

0-6948 

-2-87 

B ^ B 

fl 

0-7370 

-1-68 

V 

18 

■is: : 1 

0-728 

0-7654 

-1-16 

0-8166 

+0-13 

B S B 

Bfifl 

0-7823 

-0-68 

VI 

36 

IjljH 

0-746 

0-7969 

-0-22 

0-8166 

+ 0-38 

Bl 

B 

0-8130 

+0-28 


Values of from the untransformed observations and from the transformed 
observations, both following Bartlett (i.e. the transformation (k-I-|)^) and 
following the line suggested in the present paper, are shown in Table 9 for the 
field data of Tables 1-6. For the second transformation the values of k as calculated 
from (14) are shown as well as the nearest value of k entered in Table 8. For 
each experiment the value of nN and also the 0-05 limits of probability, from 
Geary (1935), are shown. There are also shown the departures of observed 
from the expected value in terms of the standard deviation, a useful criterion 
since the distribution of is almost normal. From Table 9 it can be seen that 
out of the three experiments in which feU beyond the lower 5 % limi t of 
probabihty for the untransformed data and the data transformed as (a:+i)*, 
in only one experiment did fall so with the final transformation. The results 
for Experiment II, in which is decreased by the transformation, are peculiar, 
Consideration of the departures from the mean in terms of the standard devia- 
tion indicates more clearly the improvement effected by each transformation 
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and how the transformation suggested in the present work secures an improve- 
ment of the same, but more marked, character than that secured from the 
transformation of Bartlett. The results suggest that while homoscedasticity may 
not be attained always, it will be approached by means of the proposed trans- 
formation. 

6. The befeot of the teansfoemation on the analysis 

OF VAEIANOE 

As was indicated at the beginning of § 5, our proposed transformation besides 
making the variability within a block for a repeated treatment the same for all 
treatments and blocks, should also provide quantities satisfying the assump- 
tions underlying the analysis of variance. Since it is not quite clear how, in so far 
as the transformation is satisfactory in the first way, it will necessarily be satis- 
factory in the second, it will be well to consider directly the suitability of our 
transformed values for the analysis of variance. 

In the application of the analysis of variance one would deal with rather 
than with and suppose that 

— A-{- + Dij, (16) 

where A is a contribution from the general level of population on the experi- 
mental area, the contribution of the ith treatment and Gj the contribution of 
the jth block. The remainder term, D^j, is called the interaction of treatments and 
blocks. Of course, the present discussion on the untransformed values, 
holds for the transformed values, + iSya) when the appropriate 

symbols, A ', O') and are used. 

In material satisfying the conditions underlying the analysis of variance, for 
the observations under each treatment, the calculated squared standard devia- 
tion is 1 ^ 

= s, (17) 

ToUowing the argument of the analysis of variance, of which the mean 

is 0, is an estimate of Gj + D^j, in which the two terms are independent ; hence the 
expectation of s? is o-J = (18) 

where cr^j and are the standard deviations of the parameters, Cj and B^j, 
respectively, and are independent of treatment. Accordingly, Sj should be in- 
dependent of treatment and distributed as an estimate of cr^, having JV — 1 degrees 
of freedom. Conversely, if a;y_ cannot be built of the independent terms of (16), 
then the various values of will not be distributed as estimates of a single 
standard deviation. The hypothesis that the values of Sj in any one experiment 
are estimates of one quantity may be tested."! 

t From correspondence with Dr R. W. B. Jackson, the writer has learned that he had arrived 
independently at the same test. 
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The results of the tests on the homogeneity of the values, within the six 
experiments treated in the present paper are presented in Table 10, where the 
value of the Lj criterion is shown for the original data and for the transformed 
values together with the appropriate 0-06 and 0-01 levels of probability (Nayer, 
1936). From Table 10 it can be seen that of the values of obtained from the 
original data, all but one are near or beyond the 0-06 level of significance, but 
that after transformation all are moved in to less significant values. Accordingly , 
the values of when calculated from the original data, appear heterogeneous but 
the corresponding values obtained after the transformation appear homogeneous. 
Thus it is more probable that the analysis of variance is applicable to the trans- 
formed data than to the untransformed. 


Table 10. The homogeneity, as measured by the criterion L^, of the estimates 
Si for various values of i before and after transformation 



Experiment 


% 

3 

B 


6 



Li before trauafomation 

0-861 

0-833 

0-325 

0-651 

0-344 

0-680 

Li after transformation 

0-804 

0-941 

0-706 

0-688 

0-813 

0-730 

1 % limit 

0-167 

0-767 

0-604 

0-642 

0-614 

0‘683 

5% limit 

0-813 

0-812 

0-707 

0-658 

0-648 

0-673 


In Table 10 we have tested the homogeneity of the estimates, s^, as in § 5 we 
tested the homogeneity of a^p that is without reference to the values of the 
associated means. In view of our original assumptions we are, however, interested 
in the possibility that the standard deviations, as calculated, might show every 
sign of being estimates of a common standard deviation and yet be dependent 
on the associated means. Accordingly, we have investigated such dependence 
roughly by fitting by least squares a first order regression of on . From this 
fitting we record the sign of the regression as follows: 



Experiment 


B 

2 

3 

4 

6 

6 

Before tranaforraation 

■f* 




+ 

+ 

After transformation 

- 



B 

+ 

+ 
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By a single asterisk we have indicated oases where the reduction in variability 
effected by the regression passed the 5 % probability limit and by a double 
asterisk where it passed the. 1 % limit. Several points may be noted. (1) In two 
cases (Experiments 2 and 6) after transformation the residual sum of squares 
about the regression was greater than the reduction in squares due to the regres- 
sion, whereas it was consistently less before transformation. (2) As can be seen 
above, the regression generally did not effect a significant reduction in variability 
after transformation but did before (the small number of degrees of freedom made 
high significance difficult of attainment). (3) After transformation the sign of 
the regression seemed to be a chance matter, whereas before transformation it 
was consistently positive. These results suggest that the transformation pro- 
posed did tend to make the variability within a given treatment independent of 
the mean for that treatment. 

7. The EEEECT OE TRANSEOBMATION upon' the OONOLtlSIOXS 
FROM THE ANALYSIS OF VARIANCE 

It has been shown in §§ 6 and 6 that the analysis of variance can be made on 
entomological data when a suitable transformation has been effected. It is of 
practical interest to see what numerical effect such transformation will have upon 
tests on the significance of, say, the effect of treatment and the significance of 
differences for treatments. 

Eirst, consider the numerical results to be obtained from the analysis of 
variance (1) without and (2) with transformation. Thus the mean square asorib- 
able to blocks, treatments and their interaction is shown in Table 11, for six 
experiments of which the data are given in §2; parallel results are presented for 
untransformed observations and for observations transformed by (10) with the 
-values of h from Table 9. To facilitate the comparison of the results, the mean 
square for blocks and for treatments is expressed in terms of the estimate for 
interaction, as the F of Snedecor (1934), and presented in each case. The trans- 
formation of the data has modified the conclusions to be drawn from the analysis 
of variance in Table 11, in that there are considerable changes in the criterion, F, 
for treatments or for blocks. In the examples shown the effect of treatments was 
highly significant in all cases and so the changes introduced by transformation 
did not alter the conclusions, as would have been the case for less definite effects. 

Consider next the effect of transformation on the significance of differences 
between the means for treatments as tested by the criterion, {, calculated with 
such estimates of mean square as the interaction of Table 11, For illustration, 
values of t, from the data on Leptinotarsa decemlineata (Experiment III), are 
shown in Table 12 for each possible comparison of treaments when untransformed 
data are used, when the transformation, as suggested by Bartlett (1936a) 

is used, and when the transformation, ^:“lsmh~^ {kx)^, as suggested in the present 
paper is used. In order that the influence of the level of population under each 



Table U. The analysis of variance of untransformed and 
transformed data in six experiments 


Variation 

Degrees 

of 

freedom 

Untransformed data 

Transformed data 

Mean 

square 

F 

Mean 

square 

F 


Experiment I. P. nvbilalis 



Between blocks 

» 

92-8 

1-21 

0-666 

1-66 

Between treatments 

6 

2,839-0 

36-96 

7-61 

22-03 

Interaction 

64 

76-8 

— 

0-341 

— 


Experiment II. P. ntibilalis 



Between blocks 

9 

677-0 

6-72 

6-37 

10-65 

Between treatments 

6 

1,721-0 

20-04 

8-69 

17-07 

Interaction 

64 

86-9 

— 

0-609 

— 


Experiment III. L. decemlineata 



Between blocks 

6 

20,172-0 

2-26 

4-67 

3-03 

Between treatments 

3 

390,932.-0 

43-77 

111-5 

72-16 

Interaction 

18 

8,931-0 

— 

1-64 

— 


Experiment IV. L, decemlineata 



Between blooka 

5 

12,960-0 

2-06 

2-06 

1-96 

Between treatments 

3 

124,064-0 

30-40 

20-40 

19-41 

Interaction 

16 

6,727-0 

— 

1-05 

— 


Experiment V. P. guinguemaculnto 



Between blocks 

6 

27-4 

2-74 

0-349 

4-19 

Between treatments 

2 

762-0 

76-83 

19-8 

233-77 

Interaotion 

10 

9-98 

— : 

0-083 

— 

Experiment VI. P. quinquermculaia 



Between blocks 

5 

41-7 

4-13 

1-50 

4-12 

Between treatments 

6 

66-2 

6-66 

3-33 

9-14 

Interaction 

25 

10-1 

• 

0-366 



Table 12. The values of t, in the comparison of means, as calculated from the 
untranaformed and the transformed data of Experiment III on Leptinotarsa 
decemlineata 


Comparison 

Means for 
untransformed 
data 

— 

t without 
transformation 

f from 

— 

t from 

3inli~^ {kx)^ 


362 

30 

+ 9-27** 

■i-11-33** 

+ 9-92** 


302 

224 

+ 3-84** 

+ 3-621'* 

+ 2-11* 


362 

11 

+9-82** 

+ 12-89** 

-f 12-46** 

* 2 ., -* 3 .. 

30 

224 

-6-43** 

- 7-81** . 

- 7-81** 

® 2 ..“* 1 .. 

30 

11 


+ 1-66 

+ 2-64* 


224 

11 

4- e-os’** 

+ 9-37** 

+ 10-36** 
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treatment may be judged, there are shown, also in Table 12, the means for the 
untransformed data. The values of t falling beyond the 0-01 level of significance 
have been marked with two asterisks and the values beyond the 0-05 level with 
one. It can be seen that the transformation resulted in a profound alteration in 
the conclusions. Apparently oh account of the dependence of variance on mean 
in untransformed data, the pooled estimate of variance was originally too low 
for the treatments which resulted in high populations and too high for the treat- 
ments which resulted in low populations. Thus, in the comparison of the first 
and third treatments, which appeared to have the two highest surviving popula- 
tions, the value of t calculated from untransformed values was high. In the other 
extreme case, the comparison between the second and fourth treatments, the 
value of t, as calculated from untransformed values was very low. It can be seen 
further, that the first transformation only secured in part the modification in 
the value of t that was secured by the second transformation. 

8. The PBOCEnuRB OE transeobmation in rbactioe 

The methods which were found applicable in the preceding discussion will 
now be illustrated in the transformation of the data shown in Table 7 (Experknent 
VII) on Phlegethontiiis qiiinquermc%data, of the same type as the experiments 
previously discussed in the present paper. The steps in the analysis wfil be set out 
with the purpose of providing a model for procedure in estimating the constant, 
Ic, which will be used to effect a transformation, of the data so that the analysis 
of variance may be made. 

Supposing that the experiment has been laid out with a repetition of each 
treatment in each block, the procedure of estimating k makes it first necessary to 
find the sum and the absolute difference of each pair of plots subjected to a given 

n N 

treatment in a given plot andthen to sum the sums, S S (%i + ^m)> the sums 

n W n N 

squared, S S(%l+% 2 )^ also the differences squared, S 

i=l^=l {=1^=1 

over aU such pairs and by substituting the results in (14) to find k. In the case 
being used for an illustration the two plots subjected to the first treatment in 
each block gave respectively 10 and 7, 20 and 14, 14 and 12, 10 and 23, 17 and 20, 
14 and 13, so that 

S(%i + %2)= (10-f-7)-t-(20-l-14)-t(14+12)+... = 174. 

Similarly, 

S(a!„i + a:y2)® = (10 + 7)2-t-(20H-14)2H- ... = 6308 
i=i 

and similarly, 

(10-7)' + (20-14)2+... = 228. 

1=1 
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Of eoursej in estimating k the summations are not limited to one treatment but 
must be extended over all in the experiments. If this is done we find 

i S S S = 19.656, % - 708. 

From (14) we estimate Ic = gg^ — ^ “ 0-002, 

and referring to Table 8, p. 260, use k = 0-00 as the nearest value occurring there. 
Of course, in this case, the transformation is simply xK 

Now from the above result it will be possible to replace the observed values of 
Table 7 with the corresponding transformed values from the first column of 
Table 8. Thus in Table 7 replace in the first row; 10, 20, 14, 10, 17 and 14, by 
3-16, 4-47, 3-74, 3-16, 4-12 and 3-74. With such transformed values we can now 
proceed to carry out a routine analysis of variance which will be facilitated by 
working with the sum for each pair of plots in a given block with a given treatment. 
For example, the final analysis of variance for Experiment VII would be carried 
out with the values of Table 13. 


Table 13. Transformed and summed values to be used in the analysis of 
variance for Experiment VII on P. quinquemaculata 


Treatment 

Block 

1 

2 

m 

4 

5 

6 

1 

6-81 

8-21 

7-20 

7-90 

8-69 

7-36 

2 

7-44 

7-90 

7-74 

8-24 

8-94 


3 

l-OO 


2-73 

2-41 

1-73 


4 

3-97 

5-91 

3-73 

4-48 

4-48 

3-41 

6 

3-97 

3-97 

4-18 

2-00 

3-14 

4-46 

6 

0-32 

8-60 

7-87 

0-77 

10-20 

8-61 


9. Summary and conclusions 

The foregoing work is a study of experimental results from seven field experi- 
ments on the control of insects. In such data, the standard deviation of the 
number of insects per plot varies with the mean. By the transformation, 
x' ~ sinh"^ where is a constant and x an observation, the data were 
put in a form for which the standard deviation approached a constant independent 
of the mean. The estimation of the one constant, k, necessary for the transforma- 
tion was made possible by the design of the experiments with repetition of treat- 
ments within blocks. In practice, the transformation gave good results so that 
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analysis of variance could be made. From the analysis of the transformed data, 
the results were found to differ markedly from those which would have been 
obtained from the untransformed data. 
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APPENDIX 

As has been said, the transformation of (10) was suggested by the method used 
by Tippett (1934, p. 61). The procedure is as follows. 

It is required to find x' such that the standard deviation, cr^,/ of 

shall be approximately constant. Let us write 

X' + (19) 

where M is the expectation of x and whence, approximately, 

(x'~M')=f'(M)(x~M), (20) 

where M' is the expectation of x'. Hence 

<7|. = {/'(lf)}V2, (21) 

where <t is the standard deviation of the observations, x. Replacing. cr^- in (21) 
by a constant, c, as is the purpose of our operation, and substituting for cr from 
equation (6), p. 247, we have 

f'{M)^c{M+kM^yi, ( 22 ) 

where k is, as has been previously discussed, a constant peculiar to our data. 
Integrating in (22), 

/(Jf) = 2ch-*sinh-i(i:Jlf)i. (23) 

From (23) the form of the function suggested is sinh-^ {kx)^, but it is wise instead 
to use sinh~i {kx)^, since the transformation then becomes identical, as shown 
in (11), with the established transformation, a:*, when k = 0. 

As Tippett (1934) says: ‘This derivation is not mathematically sound, and 
the result is only justified if on application it is found to be satisfactory.’ The 
writer would have hesitated to have used it had it not already led to useful 
transformations in oases analogous to the present, namely to where x comes 
from a Poisson distribution, to 8in~^p* where p comes from a binomial distribu- 
tion and, according to Tippett^ to tanh“^r, where r is the correlation coefficient. 



262 Transformation of data from entomological field experiments 


REFERENCES 

Bahtlett, M. S. (1936a). Square root transformation in analysis of variance. Roy. 
Statist. Soe. Suppl. 3, 68-78. 

(19366). Some notes on insecticide testa in the laboratory and in the field. J. Roy. 

Statist. Soc, Suppl. 3, 186-94. 

BjsAix, G. (1936). Study of arthropod populations by the method of sweeping. Ecology, 
16, 216-26. 

(1939). Methods of estimating the population of insects in a field. Biometrika, 

30, 422-39. 

(1940). The transformation of data from entomological field experiments. Canadian 

Ent. 72, 168. 

BEAin, G., Stibbett, G. M. & Connebs, I. L. (1939). A field experiment on the control 
of the European corn borer, Pyrawia nnbilcdia Hubn., by Becmveria Baasiana Vuill. II. 
Scii. Agric. 19, 631-4. 

Bbckbb, G. F. & Van Obstrand, C. E. (1931). Hyperbolic fimctions (4th reprint). Smith- 
sonian Mathematical Tables. 

CI/APHAM, a. R. ( 1936). Over-dispersion in grassland communities and the use of statistical 
methods in plant ecology. J . Eeol. 24, 232-51. 

Geaby, R. 0. (1935). The ratio of the mean deviation to the standard deviation as a test 
of normality. Biometriha, 27, 310-32. 

Nayeb, P. P. N. (1936). An investigation into the application of Nejrman and Pearson’s 
ij test, with tables of percentage limits. Statist. Res. Mem- 1, 38-51. 

Snedkcob, G, W. (1934). Calculation and Interpretation of Analysis of Vas'iance and 
Covariance. Pp. 96. Ames, Iowa: Collegiate Press Inc. 

‘Student’ (1919). An explanation of deviations from Poisson’s law in practice. Biometriha, 
12, 211-16. 

TrPBETT, L. H. C. (1934). Statistical methods in textile research. Part 2. Uses of the 
binomial and Poisson distributions. Shirley Inst. Mem. 13, 36-72. 

Williams, C. B. (1937). The use of logarithms in the interpretation of certain entomological 
problems. Ann. Appl. Biol. 24, 404-14. 



INTERPOLATION EOR FRESH PROBABILITY LEVELS 
BETWEEN THE STANDARD TABLE LEVELS 
OF A FUNCTION 

By J. B. SIMAIKA 
1. The peoblem 

A NTJMBBB of tables of probability functions exist, and more will no doubt before 
long be available, giving values for a variable x corresponding to a limited number 
of simple probability levels a. How far is it possible to. obtain x rapidly for inter- 
mediate values of a1 

The variable may be put into standardized form as the ratio of the deviation 
from the mean to the standard deviation; these two quantities (i.e. the mean and 
standard deviation) are often easy to obtain, whereas the probability integral 
riay require extensive computation. Denote by the standardized variable 
so that 

a;„— meana; 

“ ” standard deviation of*’ ' ^ 

The question we shall consider is this: having full and accurate tables relating 
% and a for a standardized normal variable, denoted by U^, can we use these 
values as auxiliary in obtaining u„ for any other function tabled only at a few 
probability levels and, having found an interpolating formula, what is its accuracy t 
In examining this point we shall compare the accuracy of the method with that of 
some other methods of deriving intermediate values of u^. 


2, GeHEBAL APPEOAOH 


It win be useful to consider first how far a general theoretical approach will 
take us. Let the variable x follow a probability law defined only by its cumulants 
Kf{r = 1,2, ...); then the first four cumulants of the variable w become 0, 1 , 71 , 72 . 
It is known that the relation between and a may be written symbolically 






exp 




dx^ dx^ ' 


er^^^dx, 


while the same relation for a normal variable is 


( 2 ) 


a = 





( 3 ) 


Using equations (2) and (3) it has been shown by Cornish & Fisher (1937) 
that can be approximated to by a parabolic curve in and vice versa. If we 
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assume that for r ^ 6 is negligible, this expression, using a fifth degree parabola, 
is u,^A + BU,+ CUl+DUl + mt + FUl (4) 

where A ?= -i 7 i--^ 7 iya+'^'yi, 

-B = 1 - in + totI “ iMriTa - Hrir!. 

0= iri+^rira-^rf. 

-D= ^ra-^r! +W')'a — 1117172+ fH'l'n 

•®= -^7i7a+^7!. 

■f = ~iks7l +Tl37i7s~’m7i- 

Now as 7 i and tend to zero, B tends to unity and all other coefficients tend 
to zero, i.e. the curve of w„ as a function of Z7„ tends to the diagonal line 

Figs. 1-4 give these curves for different numbers of degrees of freedom for the 
commonly used statistical variables x^, X’ * ^ transformation of z referred 

to below), expressed in standardized form. 

With regard to the coefficients in (4) it may be remarked that large values of 
72 do not increase them as much as large values of Furthermore, when y-i is 
zero, the coefficients C and E vanish and the expression (4) becomes 

u^^BU^+DUl + FUl 

or p^B + DUl+F{Ul}K (6) 

These broad results suggest that a good method of interpolation, when both Jx 
and 72 exist, is a Lagrange formula through the points 

(* = 1,2,...). 

Remembering that ~ (a;„.-meana:)/(s.r). of x), from the practical point of 
view the interpolation can be carried out more expeditiously and without loss 
of accuracy by using a Lagrangian formula through 

When, however, is zero as in the case of the i-distribution it would be better to 
take the Lagrange formula through the points 

Kij or alternatively through {i = 1 , 2, ...), 

the mean x being zero. 

The accuracy of the method is likely to depend on the value of and 72 . 
For example, as seen in Fig. 1 , linear interpolation between, say, f 7 o.o 5 and ? 7 o.o 2 
will be more accurate with r = 18 than r = 3, the y’s being smaller in the former 
case. Again, the curves will be more nearly linear if we take as variable 

u = (x - mean x)/a'^ rather than w = - mean x®) /(r^‘ 

because the y’s for the former are the smaller. 
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Por the practical worker it will often be sufficient to use linear mterpolation, 
i.e. to make use of two tabled probability levels only. For more accurate work 



Standardized normal variate, i7 
Fig, 1. Relation between and U, 

three or more levels can be used, but an increase beyond this is not, in fact likely 
to lead to a gain in accuracy which will be worth the labour. 
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Interpolation for probability levels 

3. CoMPABISON OP METHODS 
To compare the accuracy of the interpolation based on the polynomial 
expansion (4) with that obtained by other possible methods, we have considered 
the t and v (beta)-probability ^stributions. For each, different methods of 
interpolation have been devised. In some a transformation of the variable has 



been used, while in others a transformation of the argument or a completely new 
argument has been considered. 

The accuracy of each method in the range covered by the values a <0-10 
and a 0'90 has been tested in the following way: between any two consecutive 
probability levels a number — not less than three — of intermediate values have 
been interpolated. These values were chosen to be those which could be obtained 
accurately from some other table. The greatest deviations in each interval are 
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given in Tables 1—3. The methods have been arranged according to the degree 
of accuracy obtained. 



It remains to point out that, if the n tabulated probability levels are denoted by 

<<Xa< ... O'lO, 

and if the interpolation is to be carried by using a quadratic or higher expression 
in the argument, it has been found that the interpolated value is always more 

Biometrika xxxii i8 




268 Inhrpolation for ’probability levels 

accurate when the probability levels used include as many as possible of the 
probability levels below Similarly, if 

0-90<ai <a2< ... <a„, 

it is better to use probability levels including as many as possible of those above 


-UO -1-5 -2-0 -2-5 



StendatdUed normal variate, V 
Kg, i. Relation between u{v) and U, where p(^v) = (1 — 


4. The x®-PROBABiLiTy eunotiok 
The standardized form of used in Kg. 1 is 


u{x^) = 


V(2v)‘ 


( 6 ) 


V being the number of degrees of freedom. The yi and yg of this distribution are 
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^{8lv) and 12/v respectively. If we consider % itself instead of we find that its 
standardized form has the approximate value, used in Fig. 2, 


MX) = 


xV2-V(2t>-l) 


(7) 


and its and are (2u)“i + 0(v~®) and 0(v~^) respectively. Both these last 
quantities are smaller than those for x®, which suggests that interpolation will 
be more accurate using x rather than x®- This can also be seen from Figs. 1 and 2. 

The X® probability levels have been tabulated by Fisher (1941) for a = O-Ol, 
0-02, 0-05, O'lO, 0-90, 0-95, 0'98, 0*99 and are given to three decimal places. 
These levels were used in the interpolation.* The accuracy of the interpolated 
values obtained by the eight methods detailed below was checked either from the 
Tables of the Incomplete Oamma Function (Karl Pearson, 1922) or from Tables 
or Statisticians and Biometricians, Part I, Table XII (Karl Pearson, 1930). 

The greatest deviations, d^{m = 1,2, obtained using in all eight different 
methods, are given for v = 3, 5, 9 and 18 and for intervals of a: (0-01, 0-02), 
(0-02, 0-06), (0-06, 0-10), (0-90, 0-96), (0-96, 0-98) and (0*98, 0-99) in Table 1. 


Method 1. 

X® = .4 + Bloga (a<0'10), x* = ^ + (a>0‘90). 

Method 2. 

■m(x) = B log a (a<0'10), u{x) Blog{l — a) (a>0*90). 

Method 8. u{x‘) — A + BU. 

Method 4. 

u{x^)—U = A + Bloga (a<0'10), u{x^)--U - A + B\og{l~a) (a>0-90). 

Method 6 . «(x) — A + BU. 

Methods. X® = ^ (a<0'10), 

X® = ^ + Blog(l— a) + (71og®(l — a) (£z>0-90). 

Methods. u{x^) = A-i- BU GU^. 

Methods. uix) — A + BU -vCU^. 

The best linear interpolation is that provided by method 6 and the best 
quadratic one is given by method 8. Both these methods are those suggested by 
the general approach. 


* Certain additional levels are given in a recently published table computed by Catherine M. 
Thompson (1941). 
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Table 1. Greatest deviations in the interpolation of x® and x- 


V 

Ti 

72 

Interval 



l-^il 







i«,i 

\K\ 

3 

1-63 

4'00 

(0'02, 0-05', 

0'02 997 

0-245 

0-006 

0-007 

0-011 

0-012 




0-001 




(0’06, O' 10) 

0-07 890 

0-490 

15 

8 

10 

1 

4 

4 

3 

0 




(0'90, 0'95) 

0'92 810 

7-000 

4 

3 

27 

14 

6 

1 

0 

1 




(0'96, 0'98) 

0'97 071 

9-000 

5 

33 

36 

10 

0 

0 

0 

2 




(0'98, 0'99) 

0-98 827 

11-000 

1 

13 

34 

6 

1 

2 

1 

2 

6 

1'26 

2'40 

(O'Ol, 0'02) 

0-01 353 

0-632 

8 

5 

6 

3 

2 

6 

0 

0 




(0'02, 0'05) 

0-03340 

0-949 

23 

13 

16 

3 

6 

6 

1 

0 




(0'06, O'lO) 

O'Oa 160 

1-285 

19 


11 

0 

3 

4 

0 

1 




(0'90, 0'96) 

0-92 808 

10-119 

11 

32 

26 

12 

5 

3 

1 

1 




(0'96. 0'9«) 

0-96 544 

12-017 

13 

39 

32 

12 


2 

1 

0 




(0-98, 0-99) 

0'98 579 

14-230 

5 

18 

17 

6 

4 

2 

1 

1 

B 

0-94 

1'33 

(0-01, 0'03) 

0-01 060 

2-121 

4 

2 

2 

2 

0 

1 

1 

■0 




(0'02, 0'06) 

0-03 462 

2-970 

34 

21 

19 

0 

6 

6 

0 

1 




(O'Ofi, O'lO) 

0'07 705 

3-818 

33 

24 

17 

6 

6 

8 

1 

0 




(0'90, 0'95) 

0-93 312 

16-000 

19 

38 

17 

11 

6 

4 

0 

0 




(0'95, 0'98) 

0-96 483 

18-000 

22 

46 

31 

12 

7 

4 

1 

1 




(0'98, 0'99) 

0-98736 

21-000 

8 

18 

14 

5 

4 

2 

1 

0 

18 

0'67 

0'67 

(O'Ol, 0'02) 

O'Ol 167 

7-200 

14 

9 

5 

2 

1 

4 

1 

1 

■ 



(0'02, 0'06j 

0-02 793 

8-400 

47 

32 

20 

2 

6 

7 

0 

1 

■ 



(O'OS, O'lO) 

0'07482 

10-200 

48 

34 

18 

6 

6 

11 

I 

0 

■ 



(0'90, 0'96) 

0'93 159 

27-600 

34 

52 

26 

10 

6 

7 

1 

1 

■ 



(0'96, 0'98) 

0-96 255 

30-000 

34 

66 

28 

10 

6 

6 

1 

1 

I 

■ 


(0'98, 0'99) 

0-98 689 

33-600 

16 

26 

13 

6 

4 

3 

1 

1 


For methods associated with subsoripts to d, see p. 269. 


Table 2, Greatest deviations in the interpolation of t 


1 



a 

True t 


l-Ssi- 




l«el 

m 

l«8l 

3 

oo 

(0-005, 0-01) 

0-00 892 

6-196 

0-039 

0-040 

0-035 

0-020 

0-011 

0-008 

0-007 

0-004 



(0-01, 0-026) 

0-01 430 

3-989 

64 

52 

42 

21 

16 

7 

6 

2 



(0-025, 0-05) 

0-03 261 

2-848 

28 

24 

10 

7 

8 

6 

8 

0 



(O-OSO, 0-10) 

0-07 286 

1-954 

25 

23 

11 

4 

7 

3 

3 

1 

6 

3-00 

(0-005, 0-01) 

0-00 714 

3-413 

11 

8 

5 

3 

2 

1 

1 

I 



(0-01, 0-026) 

0-01 517 

2-820 

18 

14 

6 

3 

2 

1 

1 

1 



(0-026, 0-05) 

0-03 874 

2-128 

8 

6 

1 

1 

2 

2 

0 

1 



(0-050, 0-10) 

0-06 820 

T-719 

9 

8 

1 

1 

1 

1 

0 

0 

10 

1-00 

(0-006, 0-01) 

0-00 625 

3-038 

6 

4 

1 

2 

1 

1 

1 

1 



(0-01, 0-026) 

0-01 639 

2-476 

S 

11 

1 

1 

1 

1 

1 

1 



(0-026, 0-06) 

0-03 844 

1-972 

4 

3 

2 

I 

1 

1 

0 

0 



(0-050, 0-10) 

0-07 246 

1-581 

1 

4 

6 

0 

1 

1 

0 

0 


For methods associated with subsoripts to S, see p. 271. 
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5. The i-PBOBABniry punction 

The standardized form of the f-probability function used in Fig, 3 is 

(8) 

V being the number of degrees of freedom. The values of and yj and 

6/(v— 4) respectively. 

Percentage levels for t have been tabulated by Fisher (1941) for a = O-OOS, 
0-01, 0*025, 0*05, 0*10, ... and are given to three decimal places. The level here 
defined as a is half the figure given in Fisher’s table, i.e. 


a = 



( 0 ) 


The values of t used in checking those obtained by interpolation are taken 
from Tables of the Incomplete Beta-Function (Karl Pearson, 1934), where 

4(P. O') = a, 

1 ( 10 ) 

and 

p being the number of degrees of freedom. 

The greatest deviations found in the following eight methods are denoted 
by (m = 1, 2, 8), and are given in Table 2 for r = 3, 6 and 10 and for the 
intervals (0*005,0*01), (0*01,0*026), (0*026,0*05), (0*05,0*10). 


Method 1. 

u{t) = A + BU. 

Method 2. 

u{t) — U = A + B log a. 

Method 3. 

t = A-\-B log a. 

Method 4. 

u{t) = AV -i- BU\ 

Method 5. 

u{t}^ A -{-BU-\-CUK 

Method 6. 

t ~ A + B log a + C log^ a. 

Method 7. 

u{t) -U = A + B\oga+ Clog^ a. 

Method 8. 

u(t)^AU + BW + GU\ 


Prom the methods that make use of only two probability levels, method 4 
is the one that gives the highest accuracy and from those that make use of three 
probability levels, method 8 is the best. These two methods are those suggested 
by the general approach. 


6. The n on beta-distbibution 
This variable, which is related to R. A. Fisher’s z, is defined as 


V = 


Sj __ V2 


( 11 ) 
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where 8^ is a sum of squares of normal variates based on degrees of freedom and 
8^ an independent sum of squares based on degrees of freedom. The elementary 
probability law of v is 

p(u) = {£(K (12) 


Table 3, Ormiest deviations in the interpolation of v 


B 

1 








l<5.l 



l^cl 

1^,1 

1 

1 






0-00043 



0-00007 

0-00006 







(O'Ol, 0-05) 

0-02 607 

0-69000 

296 


34 

36 

■7 

27 

1 





(0'05, 0-10) 

0-06 901 

0-66000 

79 

5 

28 

14 

6 


1 

8 

30 

-0-61 

+0-26 

(0-006, 0-01) 

O'OO 769 

0-63000 

26 

4 

7 

16 

9 

5 

8 





(0-01, 0-05) 

0'02 711 

0-59000 

228 

36 

24 

35 

4 

13 

1 





(0-06, 0-10) 

0-06 646 

0-64000 

67 

4 

16 

16 

2 

5 

2 

16 

30 

-0-28 

-0-12 

(0-005, 0-01) 

0-00666 

0-41000 

31 

3 

3 

8 

9 

7 

8 





(0-01, 0-05) 

0'02 826 

0-47000 

240 


22 

42 

3 

9 

3 





(0-05, 0-10) 

0'07 463 

0-52000 

78 

9 

1 


1 

9 

1 

8 

15 

-0-33 

-0'26 

(0-005, 0-01) 

0-00677 

0-30000 

63 

19 


13 

3 

3 

3 





(0-01, 0-05) 

0-02 619 

0-37000 

418 

128 

114 

67 

11 

22 

2 





(0-05, 0-10) 

0-06 972 

0-44000 

126 

27 

16 

29 

2 

18 

3 

4 

6 

-0-16 

-0-77 

(0-006, 0-01) 

0-00796 

0-09000 

80 

32 

58 

29 

16 

5 

6 





(0-01, 0-05) 

0-02 723 

0-16000 

838 

375 

634 

181 

93 

19 

12 





(0-06, O' 10) 

0-07 421 

0-23000 

— 

278 

123 

144 

78 

33 

8 

11 


lor methods associated with subscripts to b, see p. 273. 


Low values of v correspond to high values of z. The standardized form of v 
used in Tig. 4 is ^ 

V 



(13) 


and the values of and 72 are 


71 = 

72 = 


’^ 1-^2 l( Hh + h + ^)\ 

ri + Va+4V\ ViUj /’ 

12 ()i|(ri + 2 ) + v\{v^+ 2 ] - 21^1^2(1^1 + + ^)} 
!' i )^ 2(>^1 + V 2 + 4 )( Vi + U 2 + 6 ) 


(14) 


The cases considered here are those most generally met in tests of significance 
with and therefore < 0 . y^ is sometimes positive and sometimes negative. 

Tables giving values of v fora=0-005, 0 - 01 . 0-025, 0-05, 0-10, 0-26 and 0-60 
to five decimal places have recently been published (Catherine M. Thompson, 
1941). The corresponding upper probability levels can be obtained by entering 
the tables with and transposed and taking l-u for v. 
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The values of v used in checking the interpolation are taken from the Tables 
of the Incomplete Beta-Function (Karl Pearson, 1934). 

The greatest deviations (m = 1, 2, 7), found in the following seven 
methods are given in Table 3 for the following pairs of values of and Vg: 
(4,20) (8,30) (15,30) (8,16) (4,6) and for the intervals (0-005, 0*01), (0-01, 
0-05), (0-05,0-10) of a. 

Method 1. 

v = A + B\oga. (a<0-10), -w = ^ + .Blog(l-a) (a>0-90). 
Method 2. 

u{v)-V = A + Bloga (a <0-10), m(u)- Z 7 = ^ + J51og(l-a) (a >0-90). 

Methods. u{v)==A^-BU. 

Methods. V = A^Bloga+Clog^oL (a<0-10), 

V = .d + 51og(l — a) + 6'log2(l— a) (a >0-90). 

Methods. u{v)—U = A + Bloga + Clog^oc (a<0-10), 

u{v)—U = J. + jBlog(l — a) + (71og®(l — a) (a>0-90). 

Method 6. u{v) A + BV + GU^. 

Method 7. u{v) = .4 + J?C7+C'172 + DC7». 

Here also the best linear method and the best quadratic one are those sug- 
gested by the general approach, namely, methods 3 and 6. Method 7, a cubic in 
U, was used only because the great accuracy of the tabulated D-function justified 
the computation. This cubic interpolation gives errors of the order of 0-0001 
even for numbers of degrees of freedom as small as = 4, Vg = 5. If the interval 
between a = 0-01 and 0'06 were broken into two parts at 0-026 (a level given in 
the Thompson tables) the corresponding errors would be considerably reduced. 

7. Some numerical examples, using linear interpolation 

(a) Interpolation for a •)^ level. Suppose that we calculate the upper 2^ % 
level (a = 0-976) for with v ~ 8 degrees of freedom, using tabled values of the 
upper 2 % and 6 % levels. We require the 2 %, 2| %, and 5 % levels of the stan- 
dardized normal variable as well as the two levels. The relevant data are shown 
in the following table; as pointed out above, there is no need to calculate the 
standardized form of either x or 


a 

Ua 


Xa 

0-98 

2-0637 

13-388 

3-669 

0-976 

1-9600 

? 

? 

0-95 

1-6449 

11-070 

3-327 
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By linear interpolation, i.e. using method 3 of p. 269, we have 

, 1-9600- 1-644:9 

xU = 11-070 + (13-388- 11-070) 

= 12-857. 

Interpolating similarly for x, i-e. using method 5 of p. 269, we find 
;'i!Q -976 ~ 3-683 or ;\;o-b 76 ~ 12-838. 

The correct value taken from Miss Thompson’s table (1941) is 

Xl„, = 12-8326. 

It is seen, as expected on theoretical grounds and as evidenced in the comparisons 
of Table 1, that method 6 is the more accurate of the two. 

(6) Interpolation for a t level. Suppose that we calculate the value of t corre- 
sponding to a = 0-0125 (as defined in equation (9)), with n = 6, using the tabled 
levels for a = 0-005 and 0-026. The data required are shown below, the values of 
t being taken from the table on p. 300 of this issue. 


a 


ta 


U„ 

0-006 

- 2-6768 

- 3-7074 


6-6347 


- 2-2414 



6-0239 


- 1-9000 

- 2-4469 


3-8416 


Again, it is not necessary to calculate the standardized values of for even 
when using the relation of method 4, which assumes 

^ = A + BU% (15) 

the transference of o', to the right-hand side of the equation will only modify 
the constants A and B whose values are not directly determined in the inter- 
polation process. 

Using method I (t a linear function of U), it is found that 

^o-om ~ ~ 3-023. 

Using method 4, [tlU a, linear function of U^), it is found that 

^0-0125 ~ ~ 2-979. 

The correct value is - 2-969. As can be seen in Fig. 3, from the stretch of the 
curve for r = 6 between a'= 0-975 and 0-995, the interval chosen is too long for 
satisfactory linear interpolation. The use of formula (16) improves matters, but 
is still not satisfactory. With Fisher’s tables we could, of course, interpolate 
between the levels a = 0-010 and 0-026; doing this with method 4, it is found that 

^0 0125 ~ — 2-971 


a distinctly better value. 
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(c) Interpolation for a beta-distribution percentage level. Take the case = 8, 
1^2 = 30 and suppose it is wished to find the 2|- % level from a knowledge of the 
1 % and 5 % levels. The data required are as follows, and ?;o.o 5 being taken 
from Miss Thompson’s tables. 


a 



0-01 

2-3263 

0-54170 

0-026 

1-9600 

? 

0-05 

1-6449 

0'62332 


Then, using linear interpolation, i.e. method 3 of p. 273, we have 

1'9600-- 1-6449 

%o25 = 0-62332+ (0-62332-0-54170) X 27326^^1:6449 

= 0-58568. 

The correct value taken from the same tables is 0-68582. It wiU be seen from 
Fig. 4 that the intervals a = 0-006, 0-010, 0-025, 0-050, 0-100 of Miss Thompson’s 
tables are likely to form a satisfactory framework for aubtabulation if this is 
needed. 


8. STrSTABtrLA-TIOK OY EXISTITSTG TABLES 

These methods have been used to produce the following enlargements of 
existing tables; but it is not at present possible to arrange for their publication. 

(а) Table of percentage levels. Method 7 (p. 269) was used, as it is almost as 
accurate as method 8 and less laborious. The table calculated gives ^ to 3 
decimal places for — 1 (1) 30, andfor a = 0-010 (0-002) 0-020; 0-020 (0-005) 0-050; 
0-05 (0-01) 0-10; and for the corresponding levels at the upper end of the distribu- 
tion, i.e. for a' = 1 — a. 

(б) Table of t percentage levels. Method 8 (p. 271) was used for v = 3, . . ., 6, and 
method 6 for v = 7 (1) 30; 40, 60 and 120. Exact levels were calculated for r = 1, 2. 
The tables were computed to 3 decimal places and for 

a = 0-005(0-001)0-010; 0-0100(0-0025)0-0250; 0-025(0-005)0-050; 0-05(0-01)0-10. 

9. Interpolation fob the probability integral a, given . 

As we have already mentioned, the variable can be expressed as a poly- 
nomial in Ua- This suggests that interpolation for U^, given is as easy and as 
accurate as the interpolation discussed above. Hence to interpolate for a, given 
the value of and certain tabled levels ..., we first interpolate for 

and then find the value of a from appropriate tables of the normal probability 
integral, e.g. Tables for Statisticians and Biometricians, Part I, Table II. 
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10. COHOLUSION 

It has been shown how accurate values of the probability levels of a statistical 
variate may be interpolated between standard tabled values by using the stam 
dardized normal variate as auxiliary. For many purposes linear interpolation is 
adequate; for others a second order Lagrangian formula may be preferred. The 
accuracy of the result depends, of course, on the closeness of the actual prob' 
ability law to the normal law and on the size of the intervals between the tabled 
levels. The method has been illustrated on examples from the t and v (beta) 
distributions. 

The method has been used to provide a subtabulation of existing table.? of 
percentage points for and t, but it is not possible to have these tables printed 
with the present contribution. 

Finally I should like to express my thanks to Dr B. L. Welch of University 
College, London, for originally suggesting the problem to me. 
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PARTIAL RANK CORRELATION 

By M. G. KENDALL 


1. In interpreting an observed dependence between two qualities we are 
constantly faced with the question whether an association (correlation) of A 
with B is really due to the associations (correlations) of each with a third quality 0. 
This has led naturally to the theories of partial association and correlation, 
which attempt to decide the matter by the consideration of subpopulations in 
which the variation of C is eliminated. An analogous problem arises in ranking 
work but, so far as I know, has not previously been considered. For example, 
if a number of men are ranked according to mathematical and musical aptitude 
and there appears a significant rank correlation, it is natural to inquire whether 
this may be attributable to the correlation of both with some more fundamental 
quality such as intelligence. The object of this paper is to propose a coefficient 
of partial rank correlation which has a natural meaning and may be found useful 
for investigations requiring this type of decision. 

2. As a preliminary it may be worth examining what can be done in this 
direction with the Spearman rank correlation coefficient p. If there are three 
rankings denoted by 1, 2 and 3, we may find the three coefficients Pn, Pis, P23. 
It is tempting to apply to these coefficients the formulae of product moment 
partial correlation such as 


n — P'a~PviPn 

ii-ph)H^-p%r 


( 1 ) 


and to define P23.1 the partial rank correlation of 2 and 3 ‘when 1 is constant’. 
There is clearly very little justification for such a procedure, and it is far from easy 
to explain just what i means. In fact, the only defence of formula (1) that 
can bear critical examination is, I think, that it is an approximation to a second 
possibility, as follows; 

3. There can be no such thing as a rank correlation in a continuous population 
(the members of which are not even denumerable) but we can speak with genuine 
meaning of a grade correlation. A well-known result due to Karl Pearson states 
that in a normal bivariate population with correlation Pj, the grade correlation 


Py is given by 


Pp = 2sin-pp. 


The Spearman coefficient p may be regarded as a sample grade correlation. If, 
therefore, we take p as an estimate of Pg we may find Pp from (2 ) . For three rankings 
we may then obtain the three values of apply the ordinary product moment 

. Tra: , 

partial formula, and so obtain a partial coefficient. Since x and 2 sin do not 
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differ by more than a small amount for | a: j < 1, we might even apply formula 
(1) direct to the values of p without bothering to transform them into by 
equation (2). 

4. Such a procedure, again, is open to fairly obvious objections. Apart from 
the all-too-faoile assumption of normality and the error involved in using Spear- 
man’s p from a small sample to estimate the grade correlation in a parent, the 
fact remains that we arrive, not at a partial rank coefficient, but at an estimate 
of a partial product moment coefficient in a normal population. 

Perhaps there are cases where this is a reasonable objective based on reason- 
able assumptions but it is evidently unsatisfactory for general ranking purposes. 

5. In a previous paper (1938) I defined an alternative coefficient of rank 
correlation t which may be generalized to include the case when pairs of in- 
dividuals are compared separately (Kendall & Babington Smith, 1940). It will 
be convenient for present purposes to redefine t in a sHgbtly different manner 
so that the results obtained below may again be immediately generalized to the 
case of paired comparisons. Consider the two rankings of six 


1: 1 4 3 2 6 5 

2: 3 2 4 1 6 6. (3) 


There are 



= 16 possible pairs of ranks in each ranking, viz. 12, 13 16, 23, 


24, ..., 56. We write them down as in expression (4) below. Any order of the 
pairs will serve, and it is immaterial whether any pair is written as ab or bai but 
for practical convenience they may be written in the natural order indicated in 
the last sentence but one. This arrangement I call the recorded order. 

We then consider the occurrence of each pair in the ranking I . If a pair occurs 
in that ranking in the order in which we have recorded it, we write a plus below 
the recorded order underneath the pair concerned; in the contrary case we write 
a minus. Banking 1 of expression (3) will then give 


Recorded order: (12) (13) (14) (16) (16) (23) (24) (26) (26) (34) (36) (36) (46) (46) (66) 
Ranking 1: "k-k-kd-i- — "d-'j- — — 

(4) 


Here, for example, the pair (15) occurs in that order in ranking 1 and so is 
denoted by a -f , whereas the pair (24) occurs as 42 and is denoted by a — . 

Consider now ranking 2. The members of ranking 1 which are ranked 1,2 
correspond to members in ranking 2 ranked as 3, 1. This is in the reverse order 
to that of the pair 13 in the recorded order, so (starting a new row of signs corre- 
sponding to ranking 2) we write a minus under the recorded pair (12). Similarly, 
the pair in ranking 2 corresponding to 15 in ranking 1 is 36. This is in the same 
order as the recorded pair, so we write a plus below the existing pins under the 
recorded pair (16). The pair in ranking 2 corresponding to 23 in ranking 1 is 41. 
This is in the reverse order of the recorded pair, so we write a minus below the 
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recorded pair (23) in the row of signs corresponding to ranking 2. And so on. 
This takes rather a long time to explain but the process is really very simple. 
The array corresponding to expressions (3) is then 

Recorded order: (12) (13) (14) (16) (Id) (23) (24) (26) (26) (34) (35) (36) (46) (46) (56) 

Ranlring 1: 4- + + + + _ _ + + _^..|. 4 . + _ 

Ranking 2: — + — + + — _ + + 

( 6 ) 

Now in expression (5) there are eleven cases in which both rankings have the 
same sign and therefore 16 — 11 = 4 in which they have the opposite sign. The 
coefficient t is then given by 

_ ll-4_ 7 
15 "15’ 


Generally, if there are, in two rankings of u arrayed as above, cases of the same 
sign and of opposite sign 

n{n~l) 


^8^ iS, 

n(n~l) n{n-l)' 


(6) 


If we arrange ranking 1 in the natural order 1, . . ., w, then every case in which 
there are the same signs in expression (5) corresponds td a case in which pairs in 
ranlcing 2 are in the natural order; and every case of different sign to one in which 
the pairs in ranking 2 are in the reverse of the natural order. The definition of (6) 
thus accords with the one originally given in my 1938 paper. It is often con- 
venient to take one ranking to be the natural order 1, . .., so that the first row 
of signs hi (6) are all positive. For instance, on rearrangement of the rankings in 
(3) we have 

1: 1 2 3 4 5 6 

2: 3 1 4 2 6 5 (7) 

and the array of paired comparisons becomes 

Recorded order: (12) (13) (14) (16) (16) (23) (24) (26) (26) (34) (35) (36) (45) (46) (66) 
Ranking 1: + -I- + + + + + + + + + + + + -I- 

Ranking 2: — + — + + 4- + + + — + + + + “ 

( 8 ) 


Here again /S^ = 11, fSj = 4, 

6. Consider now three rankings, of which the first may be taken to be the 
natural order 1, to, for example. 



1 : 

2 ; 

3: 


1 2 3 4 5 

3 1 4 2 6 

4 2 16 3 


6 

6 

5 


(9) 
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The corresponding array of pairs is 


Kecorded order : (12) (13) (14) 

(16) 

(16) (23) (24) 

(26) (26) 

(34) 

(36) 

(36) 

(46) 

(46) 

(66) 

Ranking 1: 

+ + + 

+ 

+■ + + 

+ + 

+ 

+ 

+ 

+ 

+ 

+ 

Banking 2 : 

- + - 

+ 

+ + 4- 

+ + 

— 

+ 

+ 


+ 

— 

Ranking 3: 

— — 


+ - + 

+ + 


+ 



— 

+ 

(lOi 

For the coefficients r we 

have 







\ X v; 




(as above) = 

7 

16’ 










9-6 

3 









^13 

15 

16’ 










7-8 

1 









■^23 

“16 

16’ 








in the last case Si being as usual the number of cases in which pairs in rankings 
2 and 3 have the same sign. 

Consider now the fourfold table setting out the occurrences of + and — 
signs in the rows of expression (10) corresponding to rankings 2 and 3; 


Kanking 2 



+ 

- 

Total 

+ 

6 

6 

11 

- 

3 

1 

4 

Total 

9 

6 

15 


( 11 ) 


Here, for example, there are six cases in which pairs of ranks have the same 
(positive) sign, five in which ranking 2 is negative while ranking 3 is positive, 
and so on. 

Generally, if for three rankings of n the table is of the form 


Banking 2 



+ 

- 

Total 

+ 

a 

6 

Oi'\‘h 

- 

0 

d 

c+d 

Total 

a + c 

6+d 

N. 


( 12 ) 
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I define the partial rank correlation of 2 and 3 on 1 as 

j „ ad— be 

“ + V{(a + b)(c+ d) {a + c)(b + d)} 


(13) 



(14) 


where N — 



and is the ordinary mean square contingency for the four- 


fold table. 

In the particular case here considered the coefficient is 


6-16 

" V(‘i. 11.9.6) 


-0-186, 


as compared with r 23 = — 0-067. 

In the case when ranking 1 is not the natural order 1, n the same prin- 
ciples apply, but in considering the communalities of 2 and 3 we count as con- 
tributing to the a-cell in (12) the pairs which are themselves of the same sign and 
are also of the same sign as the term in ranking 1 ; and so on. 

7. The partial r defined in equation (13) is a coefficient of association in 
the 2x2 table suggested by Yule (1912). When the attributes of the table are 
independent and only in this case it is zero. It is -I- 1 only if b and c are both zero 
(i.e. if the two rankings agree in all pairs and hence are identical) and - 1 only 
if a and d are both zero (i.e. if the two rankings disagree in aU pairs and one is 
the reverse of the other). In this latter property of attaining unity only when 
diagonally opposite cells are both empty it differs ftom two other coeffi-cients 
proposed by Yule.* 

Thus partial t as defined is a measure of the association of agreements of the 
rankings 2 and 3 when compared in pairs with ranking 1. From this viewpoint 
it will be seen that the use of the word ‘partial ’ conforms to that in the theories 
of association and correlation. The partial associations of A and 5 in a third 
population containing G and y are those oi AG and BC or of Ay and By. The 
partial association of ranks is that of pairs of agreements of 2 1 and 3 1 . The partial 
correlation of 2 and 3 when 1 is constant is paralleled by the partial rank correla- 
tion of 2 and 3 when 1 is in the natural order 1, ..., n. If partial t is unity in 
absolute value the rankings coincide or one is the reverse of the other, whether 
they agree with ranking 1 or not, so that the coefficient fulfils its proper function 
of measuring the relationship between 2 and 3 independently of the influence of 1 . 


* Yule himself arrived at the ooeflScient by considering product-moments in a 2 x 2 table. 
Karl Pearson & Heron (1913) mistook Yule’s intention and thought the ooefSoient was proposed 
as an estimate of the correlation in a normal population whose frequencies were given by a double 
dichotomy in the 2x2 table. Their long memoir is mainly devoted to advocating the alternative 
merits of tetrachoric r. The two things, as Yule has emphasized, are quite different. 1 mention 
the point to make it clear that the Pearson- Heron criticisms of Yule’s coefficient, even if not mis- 
founded, do not affect the above work, since I use the coefficient purely as a measure of association 
in the fourfold table. 
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8. We may establish the remarkable result 

^a3~'^ia'^x3 

In fact, from expression (12) we see that 

(as + fe) — (o + d) 

(a+c)~(b + d) 


{a+d)~{b + c) 

'^ 23 - 2f 

Eemembering that N = a + b + 6 + d'we have 
1 -'^12 (“+*)(« +‘^)- 


' (16) 


^23 - 'TiiTn = ^ [(a + 6 + C + d) {(o + tZ) - (6 + c)} 

— {(ft + &) ~ (c + d)} {(ft + c) — (6 + (?)}] 
4 


Equation (16) follows at once from the definition of partial r in equation (13). 

The appearance of the product-moment type of relation between total and 
partial correlations is surprising. There was no reason to expect that partial t, 
which is a pure function of disarrangements in rankings and is not expressible 
algebraically in terms of the ranks, should bear any analogy with the partial 
correlation of variates; but since it does so, we are evidently fortified in regarding 
partial t as a convenient measure of rank correlation. 


Example. Ten men are ranked according to (1) intelligence, (2) mathematical ability, 
(3) musical ability. The ranliings are: 

1: 123466789 10 

2: 146627398 10 

3: 4136267 10 9 8 

It will he found that rjj = 0'644, Tj, = 0-644, Tjj = 0-666. Thus mathematical and 
musical ability are positively correlated. The question is, can this correlation be attributed 
to the correlation of both with intelligence t 


We find 


_ 0-666 -(0-644)® 

1 ^- 644 )® " 


= 0-24. 


The conclusion would be that although part of the total correlation is due to the correlation 
of both with intelligence, part of it is not. We cannot attribute the whole of the observed 
(total) correlation between mathematical and musical ability to the interference of common 
correlation with intelligence. 
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9. The same methods are immediately capable of extension to paired com- 
parisons. In fact the array of type (10) is the array of paired comparisons for all 
the possible pairs of ranks and when there are no constraints of the ranking 
character (i.e. such that ii A and B-^0 then must A-^G) the coefficient t 
becomes a measure of agreement in paired comparisons (of, KendaU & Babington 
Smith, 1940). We could then construct measures of partial agreement by the 
same formulae in cases where it was suspected that there were mutual influences 
at work between three observers, as for instance if it were suspected that com- 
munity of preference between two children was due to community of both with 
one of the parents. 

10. In conclusion, it may perhaps be desirable to point out that although 
partial r is defined by equation (14) in terms of for a fourfold table, its signi- 
ficance cannot on that account be tested in the Type III distribution with one 
degree of freedom. I have not yet succeeded in finding expressions for the sampling 
distribution of partial r but it seems clear that the Type III distribution will not 
be reproduced in the ranldng case, at least without some substantial modification, 
even when the rankings are independent; for there exist correlations between the 
signs given by any ranking in the recorded order. If, for example, (13) and (23) 
are positive so must be (13), whereas if (13) and (23) are positive (12) may be 
either positive or negative. The units in the fourfold table cannot therefore be 
regarded as allocated at random and the type III distribution will probably not 
hold, I hope to return to this subject on a later occasion. 


REFEBENOES 

Kbndail, M. G. (1938). A new measure of rank correlation. Biometrika, 30, 81. 
Kenuau:,, M. G. <te Babington Smith, B. (1940). On the method of paired corapariaons. 
Biometrika, 31, 324. 

Pbabson, TCatct, & Hbeon, D. (1913). On theories of association. Biometrika, 9, 159. 
y-DXB, G. Udnv (1913). On the methods of measuring the association between two attri- 
butes. J.B. Statist, Soo. 75, 679. 


Biometrika xxxn 


19 



INEQUALITIES FOR MULTIVARIATE FREQUENCY 
DISTRIBUTIONS 

By C. E. V. LESER 


Given a frequency distribution y of & single variate x, witb arithmetic mean 
£c = 0 and vritb standard deviation. (r,Teh.ebyclieff’s (1867) vrelbknown, inequality 
presents a lower limit for the ratio of the frequency of all values, of x between 
“Ao* and Act to the total frequency, where As 1. In the case of the special class 
of frequency distributions for which y{x)-\-y[-~x) is a non-increasing function of 
I a: I , this inequality can be substantially improved to another limit which applies 
to all positive values of A, as already Gauss (182S) has proved. Various authors* 
have generalized these theorems by modifying the assumptions made with 
regard to the frequency distribution, by introducing moments of higher order 
than the second or by extending some of the results to bivariate functions, In 
the present analysis only moments of second order are used, but the results apply 
to frequency functions of any number of variates. 

Suppose ?/(xi, to be a frequency distribution of % variates, with total 
frequency equal to unity, arithmetic mean at the origin and with standard 
deviations o-^, ,..,(r„. Let P be the frequency of all combinations of 
for which ' x 

w \K<^J 


Write 




n 


so that Ag and erg are the harmonic averages of A^, . . . , A^ and (rf , , . . , respectively. 

We also write 

A . iT . 

{i = l,...,n), 


n 


Ai <r,. 


'to 




-i-l +...+ 




80 that P is the frequency of all values of . . . , for which g AqO-^. Further- 

more, we define A[Rg) as the average value of y for all those values of Xi, . • 
for which B has a fixed value B^, and therefore 


A(Po) — 


..Aydxj^...dx^ 

„ iBSt J 

I • » . dxi * . I dXfi 

Jn=R, ^ ” 


♦ E.g. K. Pearson (1919), B. H, Camp (1922), S. Karumi (1923), 0. D. Smith (1930), P. 0. Berge 
(1937). 
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We can now state the following : 

Theorem. Assuming the frequency distribution to be such that A(jR) is a 
non-increasing function of Ji for we have one of the following three 

sets of inequalities, according to the value of a:: 

L 


(i) 


Ao^l 

P>0, 

(ii) 


l^Ao 

1 

p>l_A 
= A§- 

II. 




(i) 




(ii) 


(,+ 2 ) 


(iii) 



p> J:. 

“ A®' 

III. 




(i) 




(ii) 1 

(4i) 

i/"/n + 2\t , 2 V/” 


(iii) 


(„ + 2) 

/C® 

(iv) 


k^Ao 

P> 1--^. 

- Ag- 


Proof. We introduce the new system of co-ordinates 


cos ti cos ^2 cos tg . . . COS 
aij == sin cos cos tg . . . Cos t„_i, 


Xg = /tgTg jR sin cos tg . . . cos tn-ii 


^n—l> 

so that y(Xj^, ...,Xn)dx^... dx^ = 

where z{R,ti,--’, in-i) = • • •. <«-i) cos cos® tg . . . cos”-® «„_i. 
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We also write s = cTq ^n., 

and we are going to use the following abbreviating symbol 

f* /•iTr 

I f{^>t)dt= I I ...I ••• 

Then f” { B^'^z{R,t)dlidt = 1, 

f f Ji'‘-^z(B,t)dJtdt ^ F, 

Jn==oJ T 

and as J. (F) = const. z(Jt, t) dt, we have the condition that z{R, t) dt 
JT J T 

increasing function of JJ for i? ^ ks. Furthermore, let us write 

I t'j dt^ 

Jt 


G 


and later on 

Now start from the equation 
va 


O 

u = — 
n 


v*in/ 


’ 00 

r CO j 


J ajjsa— 00 ^ 

'aTn^-co 1 

Wn-^nJ - 


which can be written 

(-^ + - +^) - 

or s2 = Jj + J 2 , 

f R^+^z{R,t)dRdt, /2= f” f R^’-+^z(R,t)dRdt. 
Ji2=oJr 

According to the value of ks, three cases have to be distinguished. 


(a) KS 

For FgAgS 
Hence 


t . n ~\Vii 

{A,sr+~{\-p) 


or P S 1 — («:" — Aj‘) u. 

J z{R, t) dt ^ Q. 


CKb 

k^&\ . 

JjR-^O 


R^+^dR, 


'A 8 

and incidentally F^G ” R”^-^dR — A^u. 

Jr=o 


is a non- 


• • ) dx,i 
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To obtain a limit for we make use of the equation 

B^-XR,t)dRdt^l-P^Q\ ® R^--^dB 

J 2e=AosJ T J H=AtS 

or f” f B'^-^z{B,t)dRdt 

.[(A.8)»+g(l-P,]''’' 

R’^-^]^-lz{R,t)dt\dR. 

As the integrands ate nowhere negative and R nowhere smaller in the integral 
on the left than in the integral on the right, we can write 


« 

J 


E“[(A.s)"H~( 1-P)]''’‘ 


and therefore 


f R^+h{R,t)dRdi 

J T 

J(A,8r+|(i-p)]''" p . 

g (?- z(it!,t)dt dE, 

Jn=Aos L Jt J 


.feffj' 


.[(Ao3)”+|a-p)]’'" 


B-“AoS 


R^^^dR, 


J«..>-+=(>-n]'" [(J(A, .)«+„(! 

■“ '** FP2)e® ' 


n^l^{n + 2)u^^ « + 1 - 


Pgl+A; 




'^ + 2\"/<»+2) 


n 




p „ -[l/n 

(6) Aoa^/cs< (Aoa)«' + ^(l-P)J or P< 1 - (ac»- A j)w, Ao<k. 

As before, Zj^gGf R'^+^dR and PgAg^u. 

Jb=o 

Furthermore, we have the equation 

f R'^-^ziR,t)dRdt = l-P = g !"" E"-idP + ri-P-^(/c”-A^)s»'], 
Jp=a,Jt L J 

J“ J .R»-iz(P,t)dPdt = 

+ri-p 


l_p_,^(;c»-^Ans"l, 

w' J 
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and, as before, it follows that 

f” { B^+^z{R,t)dRdt>r R«+^[g-\ z{R,t)dt]dR 

R'^^HR + [ksY fl - P - - (/c’^ - Ajf) sA . 
J «-A.s L ^ J 

Hence s^^O f P"+idP + (ksY [ 1 - P - - (*•» - A'^) s’H , 

JR=0 L ^ J 


1 + P— {K»^-Ag)w], 

Ps(i-;^,) + («-;^ -<*)«. 

(c) /cs^A^s or AoSk. 

In this case, Ij need not be larger than 0, but 

4^(AoS)*(l-P). 

s2g(Aos)*(l-P). 

Therefore, if AgS 1, P ^ 1 - and if A, S 1, no significant limit exists. 

Afl 

For Xg ^ K, the problem has thus been solved, but the case A^ ^ x is more 
complicated; summarizing the results of (a) and (b), we have two alternative sets 
of inequalities. Write 


ffu) = AJm, ffu) = 1 - (x" - AJ‘) u, 



Mu) = l-tAg-it 

in + 2 
\ n 

\ n/<ii+2) 



_L 1 in 



+ ['h 

n + 2 

Then either 

Pi A, 

Pi A, 

Pi A, 

or 

PiA, 

P^A, 

Pi A- 


For different frequency distributions, G and therefore u may assume any 
non-negative value which is compatible with the condition P^ 1. We have to 
find the effective lower limit for P as a function of u and its minimum value 
which, as easily seen, is only larger than 0, if /ilO) > 0 or, what is the same thing, 
x> 1, 

/i> A> fi straight lines. As 


dfs 

du 



-vKn+i) 
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/g has a minimtim when 


and = 1 + - 


n + 2 

( 2 ' 

i(n+2)/re 

1 

_2/ 2 \ 

2/m 2 


n 

\n-l- 2 , 

1 


n\n + 2] 



H ^ \ 

2/n j 

n+2 

/ 2 ' 

s^ln 1 

1 2 ^ 

2 /n 1 

\n + 2j 


n ^ 


) Ai~^ 

U-f- 2 j 

• 1 


Indicating by f^j the co-ordinates of the point in which the curves and/j 
intersect, we find that ^ 

A. - (I)". 


/ n 

U + 2, 

n + 2/. 
2 \ 


U, fa and /4 have a point in conamon in which /4 is the tangent to/g; 


„ -'±±1 ^ f 1 n + 2K'^~X'^ 

n Ar”'+2 ’ •' ~ n /c”+2 ' 


It is also seen that the sign of both expressions ^ and U 234 - is positive 

/ 2 

OT negative according to whether A,, ^ ( k. 

\7t4* 2/ 

We have therefore four possibilities. In the following diagrams, the heavy 


Case 1 


Case 2 
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line shows the effective lower limit for P as a function of u (the upper limit is 
always equal to 1, of course), and its minimum gives the general lower limit for P . 
In these four cases, the following results are obtained; 


(1)] 




(2) • 

(3)J 

o 

Vll 


PSAs. 

(4) 

o 

All 


pm^)> 


and by substituting the proper values; 



In addition, we know that for Ag ^ /f, Aq ^ 1 : P g 1 — ~ . By rearranging these 

inequalities, we can bring them into the form in which they were given in the 
theorem, which is therefore proved. 

It may be remarked that k depends on the ratios between any two of the 
quantities Aji, 9ut it remains the same if Ai, ...,A,i change in the same 
proportion. 

Let us consider the sets of inequalities 1, 11 and III of the theorem separately. 
The most important case in which set I is relevant occurs when nothing about the 
frequency distributionis known except the averages and standard deviations of the 
variables, in which case we have to put k = 0. It provides a generalization of Toheby- 
cheff’s inequality which is obtained in the special case w = 1 in which Ao = A. 

If M. = 2, the inequality refers to the frequency of all points lying inside the 
ellipse which has the axes A^ ^2 A^ and it can be written in the following 


way; 





It is interesting to compare this limit with the one given by Berge (1937) for a 
rectangle which has its corners in the points with the co-ordinates ( + Acj, + Aog) : 


p> I f ~1~ f(i- - 


where r is the correlation coefficient of the frequency distribution. If r = 0, 
Barge’s limit equals 1 — 2/A^ and is therefore equal to the limit we get for the 
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ellipse with the axes Acr^, Acg which is inscrihed to the rectangle. On the other 
hand, if f = + 1, Berge’s limit becomes equal to 1 — 1/A^, and we have to choose 
the circumscribed ellipse with the axes to obtain the same limit. 

Hence, the limit given here is certainly not inferior to the limit given by Berge, 
if the two variates are independent, but it is not superior to this limit, if there is 
a perfect correlation. 

Set II is perhaps of more theoretical than practical interest. An intermediate 
value of K may, however, he realized, if there is sufficient information about the 
frequencies inside, but not outside, a certain n-dimensional interval. The in- 
equalities correspond to those given by Narumi (1923) for frequency distributions 
of one variate only, but having a more general meaning in so far as P may refer 
to multiples of other quantities besides the standard deviation. The first inequality 
seems the most interesting one of this set; it reduces to a special case of Narumi’s 
inequality if we put w = 1 : 

and for = 2 it may be written 




or 




1 


xf' K^) K^ijxi+ijxiy 

Set III generalizes Gauss’s inequalities. It reduces to the first two inequalities 
for those values of A^, . , . , A,^ (if any) for which a: = co. This is the case for all 
values of A^, ...,A„, if y(hxj_, ...,kxn)+y(~'hxi, -hxj is a non-increasing 
function of j A. j for any fixed values of % ... , x^\ especially if the function y 
decreases monotonically along every straight line radiating from the origin. 

The assumption /c = oo will be made throughout the following analysis. 
Gauss’s formulae are obtained by substituting -a = 1 ; 

In the case n = 2, the inequalities are also greatly simplified; 

A2 

A„S1;P^^, 


Aogl:PSl 


or 


_L+A-2;2;P> 

ArAr^' - 

J_+JL£2:P> I 
Af+Ar “ 


l/Af-H/Af 
4\A?^Ai 
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Retiiriiing to the case of an unspecified value of n, 'We shall often be interested 
in a limit for the frequency of all values of the standardized or original variables 
for which the distance from the origin is no more than a certain value, i.e. either 


>Ci ^4) ^ n 


<rt 


or 




In the first problem, we have 


Aj — ... — A„ — Aq — 


Jn’ 


and our inequalities can therefore be written 




jji g 22/™(n + : P S 1 - 


I 2 

\w + 2/ 


In the second problem 

AiO-i == ... == 

Hence, if 

p2 g + 2)('»-3>M 

2^''”(n + 


(r|+...4-er^ r np^ 

n ' L(^ + 2)((r|+...+tr|)J ’ 

o-!+...4-<ra / 2 

n ■ \n + 2/ p® 


Again, the insertion of special values for n will simplify the formulae. 

Finally, it may be at least of theoretical interest to consider the generalized 
Gauss limit, as obtained for /c = oo, as a function of n for fixed values of A,,, and 
to compare it with the generalized Tchebycheff limit which is obtained for x g 1 
and is independent of n. For some particular values of Aq and n, this is done in 
the following table: 



1 

2 

3 

1 

0-677 

Gauss limit 

0-889 

0-961 

2 

0-500 

0-876 

0-944 

3 

0-467 

0-864 

0-940 

4 

0-423 

0-868 

0-936 

Any value 

0 

Tchehyoheff limit 
0-760 

0-889 
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Furthermore, since for Aq g ^ 

lim 

l-^CO \' 


n 


and since 


lun 1 

71-^00 " 1 " 

lim 1 • 

n— >ooL 


:1 Aifg lim 


/ 2 1 1 _ 
\»+-2j A|_ 


n->oo 4 “ 2 
1 - 


it is seen that with an increasing number of variates, 
ally its superiority over the Tchebyoheff limit and 
two limits tends to vanish. 


= 0 , 

'Ar 

the Gauss limit loses gradu- 
the difference between the 
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THE MODE AND MEDIAN OF A NEARLY NORMAL 
DISTRIBUTION WITH GIVEN CUMULANTS 

By J. B. S. HALDANE, E.R.S. 


In the early years of biometry the mode and median of a distribution, and 
especially the latter, were regarded as being almost as important as the mean. 
Later Pearson and others developed the method of moments, and recently the 
eumulants, which are readily derived from the moments, have been widely used. 
Pearson (1895 and after) discussed the relation of the mean, mode and median 
in some of the skew frequency curves which he invented. He found empirically 
that for skew curves of Type III, namely, 


= + 

when jp is positive, Mode -Mean = 3 (Median— Mean) approximately. By fitting 
for a series of integral values of p he found 


Median -Mode 
Mean -Mode 


0-6691 + 0'009425-h 


Some later writers have taken the trisection as a general law. But as we shall 
see, it does not always hold, even approximately. So far as I know, general 
expressions for the mode and median in terms of the eumulants have not been 
given, nor have the conditions been stated under which Pearson’s rule holds 
approximately. 

Consider a variate Z, with distribution df = F{X)dX, and eumulants 

The algebra is simplified if we make the transformation x = {X — m)j(r, so 
that X has mean zero, and unit standard deviation, its eumulants being 0, 

72 , j where 7,,_2 = Thus is the rth measure of the deviation from 

normality of the distribution of Z. Now may become infinite for all values 
of r, or for aU even values of r, above a certain value. It may diminish indefinitely 
or remain small. In other cases it falls with r at first, but then increases without 
limit. Thus for Pearson’s Type HI distribution, whose equation has been given 
above, 

= (f-l)!(p+l)y , and y, = (r+ 1)! (25 + l)-iq 

And in the case of any estimate of a statistic, such as the mean, variance, 
standard deviation, or skewness, based on a sample of n members, y,. = 0(n~^). 
We shall not discuss the convergence of the series obtained later. But it is worth 
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noting that expansions in Hermitian polynomials are often satisfactory asymp- 
totic expansions, even if they diverge. 

Let df = f{x) dx be the distribution of x. Then it is known that we may write, 
symbohcally,* 


f{x) = exp 


L B\\dx) i\\dx) r! \dx) 






71 


Expanding in Hermitian polynomials H^(x) = , we have 




Ts: 


4- 


lOyf+y^ 


357172 + 75 , 


6 ! 


7! 


In the special case where 7, = 0{n~^) we have 


»+■■■]■ 




120 

7i72 

144 


H,{x). 


72 

7i 

1296 


H, 


,(a:) -I- 0(m-2)J . 


To find the mode, we put -^fW) = 0- Hence 


H^{x)-^H,{x) + '^^H,{x) + .., = Q. 

The terms needed to find the root of this equation with the smallest absolute 
value depend, of course, on the behaviour of the cumulants as r increases. If 
1 72 !> I 73 1) 6tc., are not substantially less than ] 7^ |, but all are small, so that 
powers and products can be neglected, then: 




7? 


8 48 576 


Where 7,, = Oin-^’’), we have 


Hence 




So in the first case the mode is 

76 


m + 




48 


+ ... + 


i-yrzr-. 


2'-r! 




_ K‘ ^3 I ^ ^7 I . (~)''^2r+r I 

^ 2 X 2 8x| 48x|'^'"'^ 2''r!/d2 


( 1 ) 


* For a comparison of this expansion with that of the Oharlier Type A distribution, see CramAr 
(1937). The symbolic relation has also been used recently by Cornish & Fisher (1937). 
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If = 0{n), then the mode is 


. V _ J I : - 

Ou. ^ a.^a 


S/CgACj ^ «| 




2k' 2 8<| 12«:| 4/fJ 

- a' I I f4 , n(n-^\ 

2/<2 8/t| I2f4 4/i| 


TX 

The median is the value of x for which j /(«) du = or, 

J —00 


Since 


the value for which 






So 


e»*‘ 




6 ' 24 

If 7r does not fall off systematically we have 

a: = _Ii+I-3_-yL + ... + , 

6 40 336 ■■■• 


When /c, = 0{n), we find 

X* 7^a 

I 6 


+ ^1 + 


n 5r!' 

8 


2 ’-(2 r+l)r! 

'y!\^|yi 7s. ^7i7i 35yf 
4 / 6 


40"^' 48 


432 


+ 0(w“*) = 0. 


Hence 




6 40 12 ' 324 

So if powers and products can be neglected the median is 


m + 

» 


\ 6^40 336^ 7 


- V '"a 4- 


^ 6/c.‘^40Aft 3364 


+ ... + 


{-Yk, 


8r+l 


2’'(2r+l)r!4 




And when = 0{n), the median is 




'^34 


+ ; 


174 


fiKji 404 124^3244 


• O(ra-a) , 


u[—^ +-A. + 7 . V 3 _ o(n-^) 
■ 6 /^ 3^404 124^3244 ^• 


(2) 


(3) 


(4) 
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Thus the distance from the mean to the mode is approximately thrice that 
from the mean to the median if y^, y^, etc., are small quantities (i.e. is nearly 3, 
etc.) and if yg, etc., are small compared with yj^. This is often the case for 
nearly normal distributions. Thus for the best known of Type III distributions, 
that of for n degrees of freedom, aTj = n, k^= 2n, .k, = !)!«,. Hence 

from equation (2) the mode is at n- 2 + (?(«,-*). In fact it is n- 2 exactly. For 


2 32 

the median, equation (4) gives m — 0(n-^). The fraction which Pearson 

0 Q 

empirically estimated at 0-6691 + 0-0094®-'^ is therefore — | — — + 0(v~^) 

3 406p 

Consider the distribution of the estimated mean, from a sample of n, 

taken from a distribution with arbitrary cumulants k^, k^, Xg, etc. The cumulants 
of the distribution of the mean are x^, Kjn, etc. Hence equations (2) and 
(4) hold, and the mean is x^, the mode is 


2nK. 




\8a 


5XgXj 


8x1 


124 


-b 


4 




and the median is 


+ 

( - 

X3X4 

677 Xg 

Uo/ci 

124 


+■ 




174 

324x|, 




The corresponding expressions in terms of the moments may be written down. 
The skewness is not exactly Ijn of the skewness of the original curve, but nearly 
so if n is large. 

Again consider the distribution of the estimated variance from a sample of 
n members from a normal distribution with unknown mean, and variance o'*. 


The cumulants of the distribution of the estimated variance 
are 


1 

n—l 


[Sx^,-n-'^{Ex,Y] 


So-® 


K-, = 


Kg = 


n-V 


Ka — 


(n-1)*’ 


48cr8 

(w-l)®’ 


Xk = 


3840 - 1 “ 

{n-lY’ 


etc. 


Thus we are dealing with a slightly transformed distribution, and the mode is 
20 ^^ ( tit 3 ^ ( 7 *^ 

0-2 or-^ — , exactly. From equation (4) the median is 

n—l n—l 


(T‘ 


32 


3(71- 1) 405(71-1)' 


,+ 0 ( 71 - 8 ) 


I , or o* 1 


377 


302 

40677* 


+ 0(77’ 


.-3) . . 


If, however, the distribution sampled is not normal, but has a finite Xg, etc., 
then the first three cumulants, in terms of those of the original distribution, are 


24 j 84 , 4(77- 2) X* , 12XaX^ , Xg 

'^*’ 77 - 1^77 ( 77 - 1 )*'^ 77 ( 77 - 1 )* ^ 77(77 - 1 )'^ 77 *' 

Fisher (1928) gives expressions for the next three cumulants, but these become 
very complex, x(2®) having 21 terms. 
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The mode and median of a distribution 


It follows that the mean is k^, the mode 


and the median 


K 


K 


■-[-T+ 


2{2/r| + /t:4) J 

4a:| + + 

6(2<| + ^4) J 


n~^ ■{■ 0{n~'^). 


The distribution of the estimate of is symmetrical for a symmetrical dis- 
tribution. In general its mean is k^, its mode 


'fa- ' 


46/c, + 


- 658/<| -t- 216^1^5 -HOS/c^ATs 


2(6/c| + + Qk^k^ + /Cg 




the median differing from the mean by ^ of the value given above. 

Finally the mean estimate of for a sample from a normal distribution is 
36 ( 7 ^ 

zero, its mode — + 0(n~^), and its median — VO{n~^). The general 

expressions for the mode and median can readily be calculated from Fisher’s 
(1928) equations. 

We now pass to some cases where Pearson’s trisection rule does not hold, 
even approximately. 

Pearson’s Type I and Type IV curves are asymmetrical, and have one more 
adjustable parameter than Type III. Thus y^, i.e. /Jg- 3, can vary independently 
of Yi, i.e. di- In consequence the curves may be nearly symmetrical, but far from 
normal. This is so if they approximate to Type II or Type VII, respectively. In 
this case the even measures of divergence y^r may be much larger than the odd 
measures y 2 r+i) which tend to zero with symmetry. Thus we cannot neglect 
higher cumulants, or products, as in equations 1-4. For Type IV and VII curves 
the higher moments are infinite, so no formulae of the given type are possible. 
For Types I and II a formula could be given, but direct calculation is clearly 
preferable. 

A simpler case is that of the scalene triangular distribution whose graph is 
obtained by joining ( - 6, 0), ^0, (a, 0). That is to say 

Here the mean is ^(a — 6), the mode 0, and the median, if a > 6, is 


a-^[^a{a + b)], 


or 


a~b (a~b)^ [a~b)^ 
4 32a 128o2 


That is to say, the median is one-quarter, not one-third, of the distance from the 
mean to the mode, if o — 6 is small compared with a. The rth moment about zero 
2K+i-(-6)-+^] 
r(r + l){a + b) 



If lc = 


a—b 
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6V[2fc(94-2fca)] 

7(3 + P)* 


etc. 


V(«6)- r. - 

If k is small, the odd y s are of order k, whilst the even ones approximate to those 
of the symmetrical distribution. Moreover y^ = so the formulae 1-4 

clearly do not hold. 

We see then that formulae (2) and (4) are quite useful in a special ease which is 
important in sampling theory, hut have no general vaUdity. 


SXJMMABY 

Expressions are obtained by which the distances of the mean and median of 
a skew distribution can be calculated from its moments or oumulants. The series 
obtained may or may not converge. They give satisfactory results for Type III 
distributions, and for the distributions of the mean, variance, and other cumu- 
lants as estimated from samples. 
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TABLE OF PERCENTAGE POINTS OF THE 
^-DISTRIBUTION 

Computed by MAXINE MEBBINGTON 


The following table has been derived from Miss Thompson’s Tables of Percentage Points of the 
Incomplete Beta Punction {Biomeirika, 32, 168-81), by taking 

for the case Vi = l. f is the usual “Student” ratio based on an estimate of variance having 1 /= vj 
degrees of freedom. If ip is the quantity tabled, P/100 is the chance that 1 1 ^ ij, i.e. represents 
the area in the two tails of the i-distribution. The table includes certain levels for t not previously 
available and should be accurate to the five significant figures shown. 


, 60 

25 

10 

6 

2-6 

1 

0-6 

l-OOOOO 

2-4142 

8-3138 

12-706 

26-452 

63-667 

127-32 

0'81660 

1-6036 

2-9200 

4-3027 

6-2063 

9-9248 

14-089 

0-76489 

1-4226 

2-3634 

3-1826 

4-1766 

6-8409 

7-4533 

0-74070 

1-3444 

2-1318 

2-7764 

3-4964 

4-6041 

6.-6976 

0-72669 

1-3009 

2-0160 

2-6706 

3-1634 

4-0321 

4-7733 

0-71766 

1-2733 

1-9432 

2-4469 

2-9687 

3-7074 
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0-69013 

1-1937 

1-7469 

2-1199 

2-4729 

2-9208 

3-2626 

0-68919 

1-1910 

1-7396 

2-1098 

2-4681 

2-8082 

3-2225 

0-68837 

1-1887 

1-7341 

2-1009 

2-4460 

2-8784 

3-1906 

0-68763 

1-1866 

1-7291 

2-0930 

2-4334 

2-8609 

3-1737 

0-68606 

1-1848 

1-7247 

2-0860 

2-4231 

2-8463 

3-1634 

0-68635 

M831 

1-7207 

2-0790 

2-4138 

2-8314 

3-1362 

0-68580 

1-1816 

1-7171 

2-07S9 

2-4066 

2-8188 

3-1188 

0-68531 

1-1802 

1-7139 

2-0887 

2-3979 

2-8073 

3-1040 

0-68486 

1-1789 


2-0639 

2-3910 

2-7969 

3-0906 

0-68443 

1-1777 

1-7081 

2-0698 

2-3846 

2-7874 

3-0782 

0-68406 

1-1766 


2-0656 

2-3788 

2-7787 

3-0669 

0-68870 

1-1767 


2-0618 

2-3734 

2-7707 

3-0666 

0-68335 

1-1748 


2-0484 

2-3686 

2-7633 

3-0469 

0-68304 

1-1739 

1-6991 

2-0462 

2-3638 

2-7664 

3-0380 

0-08276 

1-1731 

1-6973 

2-0423 

2-3696 

2-7600 

3-0298 

0-68066 


1-6839 

2-0211 

2-3280 

2-7046 

2-9712 

0-67862 

1-1616 


2-0003 

2-2991 

2-6603 

2-9146 

0-67666 

1-1569 

1-8577 

1-9799 

2-2699 

2-6174 

2-8699 

0-67449 


1-6449 

1-9600 

2-2414 

2-6768 

2-8070 


10 

11 

12 

13 

U 

16 

16 

17 

18 

19 

20 
21 
22 

23 

24 

26 

26 

27 

28 

29 

30 
40 
60 

120 

00 







THE PKOBABILITY INTEGRAL OF THE RANGE IN SAMPLES 
OF n OBSERVATIONS FROM A NORMAL POPULATION 

I. FOREWORD AND TABLES 
By E. S. PEARSON 


1. Scope of the main table 

Denote by x^, a, random sample of n observations, arranged in ascending order of 

magmtude, drawn from a normal or Ganssian population having for probability law 

where ji and cr are respectively the mean and standard deviation of the population. The 
range, sometimes described as the spread, in the sample is x„ - x^ and we shall write the ratio 
of the range to the population standard deviation as 


w = 


cr 


( 2 ) 


No simple expression exists for the probability law /„(w) of w, but Table 1 below gives 
for specific values W of «> computed values of the probability integral 

rw 

^n{W)- /„(w)dw (3) 

J 0 

This expression is the chance that the range in a sample of n observations is less than a 
given multiple of the population standard deviation. The table has been calculated for 
samples with n lying between 2 and 20 and for intervals of 0-06 of W. The values of the 
integral are given to 4-decimal place accuracy; linear interpolation is adequate except in 
the neighbourhood of the two quartiles in each column. The method of calculation is 
described below by Dr H. 0. Hartley in a separate section. 


2. Auxiliary table for special uses (Table 2) 

When dealing with samples containing only a small number of observations the range or 
spread may often be usefully employed as a measure of dispersion in place either of fcp 
standard (root mean square) deviation or the mean deviation. For example, an estimate of 
the population standard deviation may be obtained by multiplying the range in a single 
sample or the mean range in a number of samples by the factors a„ shown in Table 2. The 
accuracy of this form of estimation of cr compared with that of other methods has been 
discussed by Davies & Pearson (1934). 

In other circumstances it may be useful to plot in serial order on a control chart the 
values of range obtained from successive samples, e.g. when dealing with the control of 
quality in mass production. For this purpose it is necessary to know certain standard prob- 
ability levels for w which will serve as control limits*. Twelve of these levels expressed as 
percentage points and obtained by interpolation in the main Table 1, are shown in Table 2. 
They replace approximate limits published a few years ago (Pearson, 1932). It will be found, 
however, that except in the case n — 3t, the discrepancies between the two tables are all 
small. As an example, the table shows that if samples of 7 observations are randomly drawn 
from a population with a standard deviation or, then in the long run only 6 % of these should 
have a range greater than 4-17cr, while 96 % should satisfy the inequality 

1 '26(r < < 4'49cr 

Probability levels for samples- with n>12 have not been included as the use of range for 
control purposes in larger samples is of doubtful value. 

* For a discussion of the use of range in problems of industrial quality control see Reports 
issued by the British Standards Institution (Pearson, 1935, pp. 89-90; Budding & Jennett, 1942). 

t Correct values for the case »=3 were given by McKay & Peayson (1933). 
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Table I . Probabililij integral of the range W in normal samples of size n 


"X n 
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0-0000 
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•Olio 
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0'25 

O ' 1403 

0-0171 

0-0020 
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•0245 
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0-35 

■1955 

•0332 

•0053 
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■2227 
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■2497 

•0643 

■0111 

•0022 


•0001 




0'50 

0'2763 

0-0666 
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ic 

o 

6 
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0-0062 
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•3719 
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•0304 

•0157 
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■6602 

•3943 

•2248 

•1247 
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•0437 
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0-6690 

0-3971 

0-2706 
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0-1204 
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o 

6 

0 0336 

1-80 

•7969 
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■4197 

■2920 
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1-85 

•8092 

•6094 

■4423 

•3138 

•2193 

•1516 

•1039 


■0479 

1-90 

■8209 

•6290 

•4649 

•3361 

•2394 

•1086 

•1178 

■0818 

•0566 

1-95 

■8321 

■6480 

•4874 

■3687 

•2602 

•1867 

•1329 

•0940 

•0661 
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0-8427 

0-6665 

0-5096 

0-3816 

0-2816 

0-2066 

0-1489 
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00768 
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■6846 

•6317 

■4046 

•3036 
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•1661 
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•0886 
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•3260 

■2460 
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■8802 

-7349 
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■4739 

■3720 

•2893 

■2232 

•1712 


2-25 

0-8884 
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0-6163 

0-4969 

0-3955 

0-3118 

0-2440 

0-1899 

0-1470 

2-30 

■8901 

•7656 

■6363 

•6196 

•4190 

•3348 

•2666 

■2095 

•1646 

2-35 

■9034 

■7799 

■6568 

■5421 

•4427 

•3582 

■2878 


•1830 

2-40 

■9103 

•7937 

•6748 

■5643 

•4603 

•3820 

■3107 

■2614 

•2025 

2-45 

■9168 

■8069 

•6932 

•6861 

•4899 


■3341 

■2735 

Ha 

2-50 

0-9229 

O -8106 

0-7110 

0-6076 

0-5132 

0-4300 

0-3579 

0-2964 
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Table 1 (cent.). Probability integral of the range W in normal samples of size n 


n 

w \^ 

2 

3 

4 

5 

6 

7 

8 

9 

10 

2'50 

0-9229 

0-8196 

0-7110 

0-6075 

0-6132 

0-4300 

O ’3670 

0'2964 

0-2443 

3-55 

•9286 

•8316 

•7282 

•6283 

■5364 

•4541 

•3820 

•3198 

■2666 

260 

■9340 

•8429 

•7448 

•0487 

'6592 

■4782 

■4064 

•3437 

■2894 

3-65 

•9390 

■8537 

•7607 

•6686 

-6816 

•6022 

•4309 

•3680 

•3130 

3-70 

•9438 

•8040 

•7769 

■6877 

•6086 

•5250 

■4666 

■3927 

•8372 

2-75 

0-9482 

0-8737 

0-7906 

0-7063 

0-6262 

0-6494 

0-4801 

0-4175 

0'3617 

2-80 

•9623 

•8828 

■8045 

■7242 

•6461 

•6726 

•6044 

■4426 

•3867 

2-85 

•9561 

•8915 

■8177 

•7415 

•6666 

•5952 

•6286 

■4676 

•4119 

2-90 

•9597 

•8996 

■8304 

•7680 

•6863 

•6174 

•6626 

•4923 

•4372 

2-95 

•9630 

•9073 

•8424 

■7739 

•7056 

•6390 

•6760 

•6171 

•4625 
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0-9661 

0-9146 

0-8537 
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0-6601 

O ' 6991 
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0-4878 
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•9741 
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■8842 
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•7750 

■7194 
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•9763 
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•8429 
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•7377 
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■6360 
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3-25 

0-9784 

0-9439 

0-9016 

0-8646 

0-8063 

0’7663 

0-7066 

0-6669 

0-6090 

3-30 

•9804 

•9487 

•9096 

-8667 

•8194 

■7721 

•7248 

•6782 

•6329 

3-35 

•9822 

•9531 

•9168 

•8761 

■8327 

•7881 

•7432 

•6988 

•6653 
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-9838 

•9672 

■9287 

•8869 

•8464 

•8034 
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•7186 

•6769 

3-45 
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•9302 

•8961 

•8673 

•8179 

•7778 

•7376 
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0 0361 

0-9037 
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0-8316 

0-7930 

0-7668 

0-7180 

3 ' S 5 

•9879 

•9677 

•9417 

■9117 
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•8446 

•8091 

•7732 

•7373 
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■9891 

•9706 

•9468 

•9192 

•8889 

•8668 

•8236 

•7898 

■7668 

3'65 

•9901 

•9734 

•9616 

•9261 

•8981 

•8683 

•8372 

•8065 

•7736 

3-70 

•9911 

•9769 

•9669 

•9326 

•9067 

•8790 

•8601 

•8204 

•7902 

3'75 

0'9920 

0'9782 

0-9600 

0-9386 

0-9148 

0-8891 

0’8622 

0-8346 

0-8062 

380 

•9928 

•9803 

•9637 

•9441 

•9222 

■8986 

•8736 

•8477 

•8212 

3 < 8 S 

■9936 

•9822 

•9672 

•9493 

•9291 

•9073 

•8842 

•8602 

■8356 

3-90 

•9942 

•9839 

•9703 

■9640 

•9356 

■9165 

■8941 

•8718 

•8488 

3'95 

•9948 

•9866 

•9732 

■9683 

•9416 

•9230 

•9034 

•8827 

•8614 

400 

Q -9963 

0-9870 

0-9768 

0-9623 

0-9469 

0-9300 

0-9120 

0-8929 

0-8731 

4-05 

■9968 

■9883 

•9782 

•9660 

•9619 

•9366 

•9199 

■9024 

•8841 

4- 10 

•9063 

•9895 

■9804 

•9693 

■9666 

■0426 

•9273 

■9112 

•8943 

415 

•9967 

•9906 

•9824 

•9724 

•9608 

•9480 

•9341 

•9193 

•9038 

4-20 

•9970 

•9916 

•9842 

•9762 

•9647 

•9630 

-9404 

•9269 

•9126 

4-25 

0-9974 

0-9925 

0-9869 

0-9777 

0-9682 

0’9576 

0-9461 

0-9338 

0-9208 

4-30 

•9976 

■9933 

■9874 

•9800 

•9715 

•9619 

•9614 

■9402 

•9283 

4-35 

■9979 

•9941 

•9887 

•9821 

•9744 

•9667 

■9662 

•9460 

•9352 

4-40 

•9981 

•9947 

•9899 

•9840 

•9771 

•9692 

•9607 

■9614 

•9416 

4 ' 4 B 

•9984 

•9963 

•9910 

•9857 

•9795 

•9724 

•9647 

■9563 

•9474 

4-50 

0’9985 

0-9068 

0-9920 

0-9873 

0-9817 

0-9764 

0-9684 

0'9608 

0-9527 

4-55 

■9987 

•9963 

•9929 

•9887 

•9837 

•9780 

■9717 

•9649 

•9575 

4-60 

•9989 

•9967 

•9937 

•9899 

•9865 

•9804 

•9747 

•9686 

•9620 

4 ' 6 S 

■9990 

•9971 

•9944 

•9911 

•9871 

•9825 

•9775 

•9719 

•9660 

4-70 

■9991 

•0974 

•9961 

■9921 

•9885 

•9846 

•9799 

■9760 

■9696 

475 

09992 

0-9977 

0-9956 

0-9930 

0-9898 

0-9862 

0-9822 

0-9777 

0-9729 

4-80 

•9993 

•9980 

■9962 

■9938 

•9910 

•9878 

•9842 

■9802 

■9769 

4'85 

•9994 

•9983 

■9966 

•9946 

•9920 

•9892 

■9860 

•9824 

•9786 

4-90 

•9995 

•9986 

•9070 

•0962 

■9930 

•9904 

•9876 

•9844 

•9810 

4-95 

■9996 

•9987 

•9974 

•9968 

•9938 

•9916 

■9890 

•9862 

•9832 

5-00 

0-9996 

0-9988 

0'9977 

0-9963 

0-9946 

0-9926 

0-9903 

0-9878 

0'9861 
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Table 1 (cont.)- Probability integral of the ra‘tige W in normal samples of size n 


\ n 
TT \ 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

250 

0-2007 

0-1644 

0-1342 

0-1094 

0-0890 

0-0722 

0-0686 

0-0474 

0-0383 

0-0309 

2-55 

•2213 

•1833 

•1614 

•1247 

-1026 

•0842 

•0690 

•0566 

•0462 

•0377 

260 

•2429 

•2033 

•1697 

•1413 

•1174 

•0974 

•0807 

•0668 

•0562 

•0465 

2-65 

•2663 

■2243 

•1891 

■1591 

■1336 

■1120 

•0037 

•0783 

•0654 

■0546 

270 

-2885 

•2462 

•2096 

•1780 

•1609 

•1278 

•1080 

•0911 

•0768 

•0647 

2-75 

0-3124 

0-2690 

0-2311 

0-1981 

0-1696 

0-1449 

0-1236 

0-1063 

0-0896 

0-0761 

2-80 

•3368 

■2926 

•2536 

•2194 

-1894 

•1632 

•1406 

•1208 

•1037 

■0889 

2-85 

•3617 

•3169 

•2770 

•2416 

•2103 

•1829 

•1687 

•1376 

■1192 

•1031 

2-90 

•3870 

•3417 

■3011 

■2647 

•2324 

•2036 

■1782 

■1668 

■1360 

■1186 

2-95 

•4126 

•3670 

•3268 

•2887 

•2664 

•2265 

•1989 

•1762 

•1642 

•1366 

300 

0-4382 

0-3927 

0-3512 

0-3134 

0-2792 

0-2484 

0-2207 

0-1969 

0-1737 

0-1538 

305 

•4639 

•4186 

•3769 

•3387 

•3039 

•2723 

•2436 

•2178 

•1944 

•1734 

310 

•4896 

•4446 

•4029 

■3646 

•3292 

•2970 

•2675 

•2407 

•2164 

■1943 

315 

•6150 

•4706 

■4292 

•3907 

•3661 

•3224 

•2023 

•2647 

•2394 

•2164 

3-20 

•6401 

■4965 

•4666 

•4171 

•3814 

•3483 

•3177 

•2896 

•2636 

■2396 

3-25 

0-5649 

0-6222 

0-4817 

0-4437 

0-4081 

0-3748 

0-3438 

0-3161 

0-2885 

0-2638 

3-30 

•6893 

■5475 

•8078 

•4703 

•4348 

•4016 

•3704 

•3418 

■3142 

•2890 

3-35 

-6131 

•5726 

■6337 

■4967 

•4617 

•4286 

•3974 

•3681 

•3407 

•3160 

3-40 

•6363 

■5970 

•6692 

•5230 

•4886 

•4557 

•4246 

•3963 

•3677 

•3417 

3-45 

•6689 

•6209 

•6842 

•6489 

•6161 

■4827 

•4619 

•4227 

•3950 

•3689 

3-50 

0-6807 

0-6442 

0-6087 

0-6744 

0-6413 

0-6096 

0-4792 

0-4602 

0-4226 

0-3964 

3-55 

•7017 

•6668 

•6326 

•6994 

•5672 

•5362 

■5063 

•4777 

•4604 

•4242 

3-60 

•7220 

■6886 

■6668 

•6237 

•5926 

•6624 

•6332 

•6051 

•4781 

■4622 

3-65 

•7414 

•7096 

•6782 

•6474 

•6173 

•6881 

•5696 

•5321 

•6066 

•4801 

3-70 

•7600 

•7298 

•6998 

•6704 

•6414 

•6132 

•6866 

•5588 

•5329 

•6078 

3-75 

0-7776 

0-7491 

0-7206 

0-6926 

0-6848 

0-6376 

0-6110 

0-6850 

0-5598 

0 - S 352 

3-80 

•7944 

•7676 

•7406 

•7138 

•6873 

•6613 

•6357 

•6106 

•6861 

•5622 

3-85 

•8103 

■7860 

•7696 

•7342 

•7090 

•6841 

■6596 

•6366 

•6118 

•6887 

3-90 

•8264 

•8016 

•7777 

■7637 

-7298 

■7061 

■6827 

•6696 

■6360 

•6146 

3-95 

•8395 

•8173 

■7948 

•7723 

•7497 

•7273 

•7060 

•6829 

•6611 

•6397 

4-00 

0-8628 

0-8321 

0-8111 

0-7899 

0-7686 

0-7474 

0-7203 

0'7063 

0-6846 

0-6640 

4-05 

•8663 

•8460 

•8264 

•8068 

■7866 

•7666 

•7466 

■7268 

•7070 

■6874 

4-10 

•8769 

•8590 

•8408 

•8223 

•8030 

•7848 

■7660 

•7472 

•7286 

•7099 

4-15 

•8878 

•8712 

•8543 

■8371 

•8196 

•8021 

•7844 

•7667 

•7491 

•7316 

4-20 

•8978 

•8826 

•8669 

•8509 

■8347 

•8183 

•8018 

•7852 

•7886 

•7520 

4-25 

0-9072 

0-8931 

0'8787 

0-8639 

0-8488 

0-8336 

0-8182 

0-8027 

0-7871 

0-7716 

4-30 

•9159 

■9029 

•8896 

•8760 

•8620 

•8479 

■8336 

•8191 

■8046 

•7899 

4-35 

•9238 

•9120 

•8998 

•8872 

•8744 

•8613 

•8480 

•8346 

•8210 

■8073 

4-40 

•9312 

•9204 

•9092 

•8976 

•8858 

■8737 

•8614 

■8490 

•8364 

•8237 

4-45 

■9379 

•9281 

■9178 

•9073 

•8964 

•8863 

•8740 

■8626 

•8608 

•8391 

4-50 

0-9441 

09362 

0-9268 

0-9162 

0-0062 

0-8960 

0-8866 

0-8750 

0-8643 

0-8634 

4-55 

•9498 

•9417 

•9332 

•9244 

•9153 

•9060 

•8964 

•8867 

•8768 

•8667 

4-60 

■9660 

•9476 

•9399 

■9319 

•9236 

•9161 

•9064 

•8976 

•8884 

•8791 

4-65 

•9597 

•9630 

■9460 

•9388 

•9313 

•9236 

•9166 

•9074 

•8991 

•8906 

4-70 

■9640 

•9679 

•9616 

■9451 

•9383 

■9312 

•9240 

•9165 

■9090 

■9012 

4-75 

0-9678 

09624 

0-9667 

0-9508 

0-9446 

0-9383 

0-9317 

0-9249 

0-9180 

0-9110 

4-80 

•9713 

■9666 

•9614 

•9560 

•9605 

•9447 

■9387 

•9326 

•9264 

•9199 

4-85 

•9746 

■9702 

•9656 

•9608 

•9568 

•9606 

■9462 

•9396 

•9340 

•9281 

4-90 

•9774 

•9736 

•9694 

■9660 

•9605 

•9669 

■9510 

•9460 

•9409 

•9356 

4-95 

•9799 

■9766 

•9728 

•9689 

•9649 

•9607 

•9663 

•9618 

•9472 

•9424 

5-00 

0-9822 

0-9791 

0-9769 

0-9724 

0-9688 

0-9660 

O -9011 

0-9671 

0-9629 

0'9486 
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Table 1 (cent.). Probability integral of the range JV in normal samples of size n 


\ 71 

PT \ 

2 

3 

4 

5 

6 

7 

8 

9 

10 

600 

0-9996 

0^9988 

0^9977 

0-9963 

0-9946 

0-9926 

0-9903 

0-9878 

0-9851 

505 

•9996 

■9990 

•9980 

•9967 

•9962 

■9936 

•9916 

■9893 

■9869 

510 

■9997 

■9991 

■ 9982 

-9971 

•9968 

•9942 

■9925 

■9906 

•9884 

515 

■9997 

■9992 

•9985 

•9976 

•9963 

•9960 

•9934 

■9917 

■9898 

5'20 

■9998 

■9993 

•9986 

•9978 

•9968 

■9956 

■9942 

■9927 

■9911 

6-25 

0^9998 

O^9094 

0-9988 

0-9981 

0-9972 

0-9961 

0-9949 

0-9936 

0-9922 

530 

■9998 

■9996 

•9990 

•9983 

-9976 

■9966 

■9966 

■9944 

■9931 

5-35 

■9998 

■9996 

■9991 

•9986 

•9979 

•9971 

■9961 

■9961 

•0040 

5-40 

■9999 

■9996 

■9992 

•9987 

•9981 

•9974 

■9966 

■9967 

■9948 

5-45 

■9999 

•9997 

•9993 

•9989 

•9984 

■9978 

■9971 

■9963 

■9964 

6-50 

0^9999 

0^9997 

0-9994 

0-9991 

0-9986 

0-9981 

0-9976 

0-9968 

0-9960 

5-55 

■9999 

■9997 

■9996 

•9992 

■9988 

■9983 

■9978 

•9972 

•9965 

5-60 

■9999 

■9998 

•9996 

•9993 

■9989 

■9985 

■9981 

■9976 

•9970 

5-65 

•9099 

•9998 

•9996 

•9994 

•9991 

■9987 

■9983 

■9979 

■9974 

5-70 

0^9999 

■9998 

•9997 

•9996 

•9992 

•9989 

■9986 

■9982 

•9977 

5-75 

l^OOOO 

0^9999 

0-9997 

0-9996 

0-9993 

0-9991 

0-9988 

0-0984 

0-0981 

5-80 


■9999 

■9998 

•9996 

■9994 

■9992 

■9989 

■9986 

•9983 

6-85 


■9999 

■9998 

■9997 

■9996 

•9993 

■9991 

■9988 

■9986 

S-90 


■9999 

■9998 

■9997 

•9996 

■9994 

■9992 

■9990 

■0988 

5-95 


■9999 

■9998 

■9998 

•9996 

■9996 

■9993 

■9991 

■9989 

6-00 


0^9999 

0-9999 

0-9998 

0-9997 

0-9996 

0-9994 

0-9993 

0-9991 

6-05 


■9999 

■9999 

■9998 

•9997 

■9996 

■9996 

■9994 

■9992 

6-10 


0^9999 

■9999 

■9998 

•9998 

■9997 

■9996 

■9995 

■9993 

6-15 


l^OOOO 

■9999 

■9999 

■9998 

■9997 

■9996 

•9996 

■9994 

6-20 



•9999 

■9999 

■9998 

■9998 

■9997 

•9996 

■9996 

6-25 



0-9999 

0-9999 

0-9999 

0-9908 

0-9997 

0-9907 

0-9996 

6-30 



0-9999 

■9999 

■9999 

■9998 

■9098 

■9997 

•9996 

6-35 



1-0000 

■0999 

■9999 

■9999 

■9998 

■9998 

■9997 

6-40 




0-9999 

•9999 

■9999 

■9998 

■9998 

•9997 

6-45 




10000 

■9999 

-9999 

■9999 

■9998 

•9098 

650 





0-9999 

0-9999 

0-9999 

0-9999 

0-9998 

6-55 





0-9999 

■9999 

■9999 

•9999 

•9998 

6-60 





1-0000 

■9999 

■9999 

■9999 

•9999 

6-65 






0-9999 

■9999 

•9999 

•9999 

6-70 






1-0000 

0-9999 

•9999 

•9999 

6-75 







1-0000 

0-9999 

0-9999 

6-80 








0-9999 

■9999 

6-85 








1-0000 

■9999 

6-90 









0-9999 

6-95 









1-0000 

7-00 










7-05 










7-10 










7-16 










7-20 










7-25 
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Table 1 (cent.) . Probability integral of the range W in normal samples of size n 


\ n 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

S 00 

0-9822 

0-9791 

0-9769 

0-9724 

0-9688 

0-9660 

0-9611 

0'9571 

0-9629 

0-9486 

505 

■9843 

•9815 

•9786 

■9756 

•9723 

•9690 

•9666 

•9618 

•9681 

•9543 

5-10 

•9861 

•9837 

•9811 

•9784 

•9756 

•9725 

•9694 

•9661 

■0628 

■9593 

5'15 

•9878 

•9866 

•9833 

•9809 

•9783 

•9767 

•9729 

•9700 

•9670 

•9639 

5'20 

•9893 

•9874 

■9863 

■9832 

•9809 

•9786 

•9760 

•9736 

•9708 

•9681 

5-25 

0-9906 

0-9889 

0-9871 

0-9862 

0-9832 

0-9811 

0-9789 

0-9766 

0-9742 

0-9718 

5-30 

■9917 

■9903 

•9887 

•9870 

•9852 

•9833 

•9814 

•9794 

•9773 

•9761 

5-35 

•9928 

•9915 

•9901 

•9886 

■9870 

•9864 

•9836 

•9819 

•9800 

•9781 

5-40 

•9937 

•9926 

•9913 

■9900 

•9886 

•9872 

•9866 

■9841 

■9824 

•9807 

5-45 

■9946 

•9936 

•9924 

■9912 

•9900 

•9888 

•9874 

•9860 

•9846 

•9831 

5-50 

0-9962 

0-9943 

0-9934 

09924 

0-9913 

0-9902 

0-9890 

0-9878 

0-9866 

0-9862 

5-55 

•9958 

•9951 

■9942 

•9933 

•9924 

•9914 

•9904 

•9893 

■9882 

•9870 

S-60 

•9964 

•9967 

■9950 

•9943 

•9934 

•9926 

•9916 

•9907 

•9897 

•9887 

5'65 

•9969 

•9963 

•9966 

•9950 

•9943 

•9036 

•9927 

■9919 

•9910 

■9901 

5'70 

•9973 

•9968 

•9962 

•9966 

•9960 

■9944 

■9937 

•9029 

•9922 

• 9914 

5'75 

0-9976 

0'9972 

0-9967 

0'9962 

0-9957 

0-9961 

0-9945 

0-9939 

0-9932 

0-9926 

S-80 

•9980 

•9976 

•9972 

•9967 

•9963 

■9958 

•9952 

•9947 

•9941 

■9936 

5'85 

•9982 

•9979 

•9976 

•9972 

•9968 

•9963 

•9969 

■9064 

■9049 

•9944 

5-90 

•9986 

■9982 

•9979 

■9976 

•9972 

■9968 

•9964 

•9960 

•9966 

•9952 

S'95 

■9987 

•9986 

•9982 

•9979 

•9976 

•9973 

•9969 

•9966 

•9962 

■9968 

600 

0-9989 

0'9987 

0-9984 

0'9982 

0-9979 

0-9977 

0-9974 

0'9971 

0-9967 

0-9984 

605 

•9990 

•9989 

■9987 

•9984 

•9982 

•9980 

•9977 

•9976 

•9972 

•9969 

610 

•9992 

■9990 

•9989 

•9987 

•9986 

•9983 

•9981 

•9978 

•9976 

•9973 

615 

•9993 

•9992 

•9990 

•9989 

•9987 

•9986 

•9983 

•9981 

•9079 

•0977 

6'20 

•9994 

•9993 

•9992 

•9990 

•9989 

•9987 

•9986 

•9984 

•9982 

•9980 

6-25 

0-9996 

0-9994 

0-9993 

0^9992 

0-9991 

0-9989 

0-9988 

0-9986 

0-9986 

0>99S3 

6'30 

•9996 

•9996 

•9994 

•9993 

•9992 

•9991 

•9990 

•9988 

•9987 

■9986 

635 

•9996 

■9996 

•9996 

•9994 

•9993 

■9992 

•9991 

•9990 

•9989 

•9988 

6-40 

■9997 

•9996 

•9996 

•9995 

•9994 

•9993 

•9992 

•9992 

•9991 

>9990 

6-45 

•9997 

•9997 

•9996 

•9996 

•9996 

•9994 

•9994 

•9993 

■9992 

■9991 

6-50 

0-9998 

0-9997 

0-9997 

09996 

0-9996 

0-9995 

0-9996 

O-9004 

0-9993 

0-9993 

6-55 

■9998 

•9998 

•9997 

•9997 

•9996 

■9996 

•9995 

•9995 

•9994 

•9994 

660 

•9998 

•9998 

•9998 

•9997 

•9997 

■9997 

•0996 

■0990 

•9995 

•9996 

6-65 

•9999 

■9998 

•9998 

•9998 

•9997 

•9997 

•9997 

■9996 

■9996 

•9995 

6-70 

•9999 

•9999 

■9998 

9998 

•9998 

•9998 

•9997 

•9997 

■9997 

•9996 

6-75 

0-9999 

0-9999 

0-9999 

0-9999 

0-9998 

0-9998 

0-9998 

0-9997 

0-9997 

0-9997 

6-80 

■9999 

•9999 

•9999 

•9999 

•9998 

•9998 

•9998 

•9998 

■9998 

•9997 

685 

■9999 

•9999 

•9999 

•9999 

•9999 

•9999 

•9998 

•9998 

■9998 

•9998 

690 

0-9999 

•9999 

•9999 

•9999 

•9999 

•9999 

•9999 

•9908 

•9998 

■9998 

695 

1-0000 

0-9999 

•9999 

•9999 

•9999 

■9999 

•9999 

•9999 

•9999 

•9998 

700 
705 
7- 10 
7-15 
7-20 

7-25 


l-OOOO 

0-9999 

0- 9999 

1- 0000 

0-9999 

0- 9999 

1- 0000 

0-9999 

•9999 

0- 9999 

1- 0000 

0-9999 

•9999 

0- 9999 

1- 0000 

0-9999 

•9999 

•9990 

0- 9999 

1- OOOO 

0-9999 

•9999 

•9999 

0- 9999 

1- OOOO 

0-9999 

•9999 

•9999 

•9999 

0- 9999 

1- 0000 

0- 9999 
■9999 
•9999 
•9999 

0'9999 

1- OOOO 
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3. The origin of the present tables 

Tables giving the expected or mean value and the standard deviation of range in random 
samples from the normal population of equation (1) were calculated by L. H. C. Tippett 
(1926) in the Department of Applied Statistics, University College, London. Since the 
probability distribution /„( m) is itself far from normal in form, it tvas evident that its mean 
and standard deviation alone would not provide all the information generally needed in 
practice. Tippett included in his paper some values of the constants and /?a of the dis- 
tribution and his- work was extended by the present -writer (Pearson, 1926, 1932) who 

Table 2 


Size 

of 

sample' 

n 

Factor 

o« 

Lower percentage points 

Upper percentage points 

0-1 

0-6 

1-0 

2-5 

6-0 

10-0 

10-0 

6-0 

2-6 

1-0 

0-6 

0-1 

2 

0-8862 

0-00 

O-Ol 

0-02 

0-04 


IPI’ 

2-33 

2-77 

3-17 

3-64 

3-97 

4-66 

3 

0-6908 

0-00 

0-13 

0-19 

0-30 




3-31 

3-68 

4-12 

4-42 

5-06 

4 

0-4867 

0-20 

0-34 

0-43 

0-59 

0-76 

0-98 

3-24 

3-63 

3-98 

■Bin 

4-69 

5-31 

5 

0-4299 

0-37 

0-56 

0-68 

0-86 


1-26 

3-48 

3-86 

4-20 

4-60 

4-89 

5-48 

6 

0-3946 

0-64 

0-76 

0-87 

1-06 

1-26 

1-49 

3-66 


4-36 


6-03 

6-62 

7 

0-3098 

0-69 

0-92 

1-06 

1-25 

1-44 

1-68 

3-81 

4-17 

4-49 

4-88 

6-16 

6-73 

8 

0-3612 

0-83 

1-08 

1-20 

1-41 

■PM 

1-88 

IKSEI 

4-29 

4-61 


6-26 

6-82 

9 

0-3367 

.0-98 

1-21 

1-34 

1-66 

1-74 

1-97 

4-04 

4-39 



5-34 

6-90 

mm 

0-3249 

1-08 

1-33 

1-47 

1-67 

1-86 

2-09 

4-13 

4-47 

4-70 

6-16 

6-42 

6-97 

11 

0-3162 

1-20 

1-46 

1-68 

1-78 

1-97 

2-20 

4-21 

4-55 

4-86 

6-23 

6-49 


12 

0-3069 

1-30 

1-65 

1-68 

1-88 

2-07 

2-30 

4-29 

4-62 

4-92 

6-29 

6-54 

■ 


Estimate of (r=a„ x range (or mean range) in a sample of n observations. 


developed an approximate method of determining probability levels for w and provided 
some provisional tables of these. The need haa, however, been felt for some time for a full 
and accurate table of the probability integral of the range toiit into place among other funda- 
mental tables associated with the normal distribution. The completion of this objective has 
been made possible by a grant from the Department of Scientific and Industrial Research, 
whose assistance in the matter is acknowledged with warm appreciation. The actual method 
of computation was planned by Dr H. 0. Hartley and the calculations were carried out under 
his supervision by Scientific Computing Service, Ltd. The scope of the main table was 
limited to 20. As n increases beyond this value there is an increasing risk that the table 
may be misleading in practice, since /„(w) becomes very sensitive to relatively slight 
departures from normality in the tails of the population distribution. 
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11. NUMERICAL EVALUATION OF THE PROBABILITY INTEGRAL 

By H. 0. HARTLEY 


The formula used for the tabulation of the probability integral P«( W) of the range in normal 
samples of size n is given in the paper printed on pp. 334-48 below, where it proved that 

\» ^00 f \n-l 

where z{x) =; (2jT)~i e-*®*. 

Certain properties of this formula and the facilities provided by certain modem calculating 
machines make this integral amenable to tabulation. 

The main work consists in the evaluation of the integral 

J a> / [^ \ n-l 

«(«)( z{x)dx\ du, (2) 

iW \J u-W ] 

by quadrature for a two variable' network of values o£n and W. The range of integration is 
from iW up to a point where the integrand 


z{u) 


J u- 


z{x)dx 

w / 


1“ 


(S) 


vanishes to 7-deoimal accuracy.* For each pointof the network n, W, therefore, the integrand 
(3) was tabulated for a set of equidistant values of u covering the range of integration. The 
interval of integration was chosen as Au = 0-2 throughout. This was sufficient to obtain 
about 6-decimal accuracy in the integral (2). 

The interval in W was taken as wide as possible but sufficiently fine to permit checking 
by differencing and the subsequent subtabulation of Pn(W) to interval 0'06, which is the 
interval in the final table. An interval of AW = 0-26 was therefore chosen for the n, TF 
network. 

For small values of W it was necessary to tabulate the integrand for all integers n for 
which P„{W) is required in the final table. For larger values of n and W, however, it was 
sufficient to calculate the final integral (1) for odd n and to obtain intermediate values by 
interpolation. Below, then, is shown the two variable network for which the integral (2) 
was produced by quadrature : 


W = 0-00 (0-26) 1-26 
IF = 1-60 (0-26) 2-76 
W = 3-00 (0-26) 3-26 
W = 3-60 (0-26) 8-00 


and n = 3 (1) 20. 

ri=3(l) 9(2)23. 
w=3(l) 6(2)23. 
n = 3 (2) 23. 


( 4 ) 


For n = 2 the final integral PjflF) is given directly by the normal integral and may be 
obtained by interpolation in Table II of Tables for Statisticians and Biometricians, Part I. 
Using the notation of that table (Sheppard’s original notation) we have 




=“(5)' 


Moreover, for purposes of interpolation, use was made of the formal relation 

Pi(TF)=l for TF>0. 

For fixed u and W and for values of n in the arithmetic progression (4), the integrand (3) 
is a geometric progression with / ^ \ a 

z(u} I J ?(») dxj 

* The integrand was calculated to 7-deoimaI accuracy in order to obtain Pn(lF) to about 
6-decinial accuracy. 
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as 


Probability integral of range 

leading term and (f or (f z{x)d^ 

\Ju~iv 7 Vu-TT / 


aa common ratio. This leading term as well as the common ratios were easily obtained from 
Table II of Tables for Staiistioians and Biametrioians, Part I and the terms of the progression 
were then automatically produced on a Mercedes calculating machine Model 38 M.S. and 
copied down in two-way tables with u as row heading, n as column heading and W as 
table heading. The values of the integrand were then checked by differencing column-wise 
and added to yield the main term of the integral (2). The correction terms which, according 
to Gregory’s formula, convert the integrand-sum into the integral were calculated from the 
differences and checked by the application of Gauss’ formula of integration. Finally, to 
obtain P„(If) the term , „ 

was produced by continued multiplication on the Mercedes and added to the corresponding 
integral (2) to yield P„( W) for all points of the above network. 

For odd values of n the integral P„(W) was, then differenced IT-wise on the National 
machine which, incidentally, produced column totals hPniW) for these values of n. Two 

w 

checks were applied at this stage. One consisted in inspecting the fourth order differences. 
As a second check, the mean range, , was calculated from the formula 


8 . 




{w)dw,* 


and compared with the eorraot mean range given in Table XXII of Tables for Statisticians 
and Biometrioians, Part II. Finally, the function PJi,W) was subtabulated to interval 0-06 
on the National machine by a method similar to that described in detail by L. J. Comrio 
(1938). 

The values of Pn{W) for even n were then obtained by interpolation with the help of 
two interpolation formulae of Lagrangian type: 

2048P„(If) = -6[P„_,(lf) + P„+,(Tr)H49[P„_5(lF) + P„«(W^)] 

-246[P„_3{IF)-i-P„+a(If)] + 1226[P„_7}F)-fP„+i(IF)], (6) 

20P„(TF) = Pn-3(^F) + Pn+B(TF)-6[P„_3(IF)+P„+3(1F)] 

+ 16[P„_i(IF)+P„+i(If)]. (6) 

Formula (5) yields the interpolate for even n from the given values of P„( W) at adjacent odd 
values of n. This formula was used throughout. In some oases, however, the resulting inter- 
polate was accurate to about 3 places of decimals only. In such oases values of Pn_s, Pn+a> 
Pn-v Pn+i accurate to 6 places of decimals and values of P„_j, Pn+a accurate to (say) 
3 places of decimals were substituted in formula (6). This yielded a ‘corrected value’ of 
P„(1F). The process was then repeated for n = n+ 2 and so on imtil all values of P„(W) 
had ‘settled down’ for even values of n. It is easy to see that the process is convergent and 
that the maximum error in the interpolate is 2 units for the 6th decimal. 

After completion of the interpolation n-wise, the interpolates Pn{W) for even n were 
differenced TF-wise, cheeked and subtabulated as for odd values of n. 

* This is true provided P„(8) = l to 6-deoimal place accuracy. 
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NOTES ON TESTING STATISTICAL HYPOTHESES 
By E. S. PEARSON 


1. In July 1939, a few weeks before the opening of the present war, a Con- 
ference on the Application of the Calculus of Probabilities was held at Geneva 
under the auspices of the International Institute of Intellectual Co-operation 
(League of Nations). At the public session at which a paper by Prof. J. Neyman 
was presented and also subsequently in some informal discussions, a number 
of questions were raised: 

(a ) In choosing a test for a statistical hypothesis, is it possible or even necessary 
to specify the hypotheses alternative to that tested? Why should not a test be 
made to depend only on the form of law associated with the hypothesis tested? 
For example, Newton’s hypothesis of gravitation was formulated and tested 
without any need to define alternative laws. 

(h) Is the method of approach to these problems advocated by Prof. Neyman 
and myself applicable to testing the appropriateness of probability laws or only 
to testing hypotheses regarding the numerical values of constants contained in 
these laws? 

After the conclusion of the conference, I set down some Notes for a few of the 
statisticians who had taken part in the discussions, hoping that at leisure they 
might feel stimulated to define their views on the subject more precisely. But 
almost before the Notes were despatched, war in Europe had intervened. The 
only reply which I received was from Prof. Gumbel, and this, after some un- 
avoidable delay, has now taken shape in the contribution printed on pp. 317-33 
below. In publishing this, it seems useful to add my own Notes, which are given 
with only minor verbal alterations in the following pages. They are in part a 
restatement with rather different emphasis of views expressed in a paper published 
four years ago (Pearson, 1938). 

2. With regard bo one of the points raised under (a) above, it should be 
remembered that a statistical hypothesis as defined by Neyman and myself is 
a hypothesis concerning the probability law of random variables. The gravi- 
tational hypothesis of Newton is not a statistical hypothesis in the sense defined; 
statistical methods may be introduced to test the Newtonian hypothesis, how- 
ever, and they will involve tests of statistical hypotheses or ‘significance tests’ 
because it will be assumed that errors of observation exist which may be regarded 
as random variables, probably taken to follow the normal distribution law. 

For example, on the Newtonian hypothesis, the angular co-ordinates of a 
planet measured from the earth as origin may at certain moments be given as 
1 = ^i,Ti = 1,2,..,). If we have a number of observations of position y^, 
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subject to observational error, the statistical problem will be to test whether 
these are consistent with the hypothetical position values or whether they 
suggest that y have some other different values at the moments of observation. 
Thus the ‘alternatives’ that we have immediately in mind will be alternative 
values for rj, not alternative gravitational hypotheses. If, however, some alter- 
native law of motion were proposed, so that we could specify definite values 

111 alternative to the values yt of tbe Newtonian law, then undoubtedly we 
could choose a statistical test which would be particularly efficient in discrimi- 
nating between the two alternatives. Such a course became possible when the 
Einstein hypothesis was formulated and the orbit of Mercury considered. But 
the absence of an alternative gravitational law does not prevent us selecting a 
statistical test which will be (a) sensitive to departures in g, y from gj, yf, but 
(6) relatively insensitive to departures from normality in the distribution of 
errors. We should make this selection because, if the Newtonianlaw were incorrect, 
we believe that this would result in a change in y^ but not in a departure of the 
distribution of observational errors from the normal law. 

This example, of course, concerns a statistical hypothesis regarding the values 
of two parameters y, not regarding the form of a probability law of random 
variables. The following general approach shows, however, that the principles 
discussed may be applied to testing hypotheses regarding probability laws. 

3. Suppose that a: is a continuous random variable and that is a statistical 
hypothesis which assumes that the elementary probability law for x is p(x | 
in the interval — oo to -f oo. Thus 

f p(x\Ho)dx^l. (1) 

J —00 

Now write y = \ i Hg) dx. (2) 

J —oO 

y will be a non-decreasing function of x having values confined to the interval 
(0, 1). Further, the elementary probability law of y will be 

p(y)=p{x)l^==l for 0^2/<l, (3) 

or all values of y between 0 and 1 are ‘equally probable’. 

Suppose now that we wish to use a set of n independent values x^, x^, 
to test that the probability law is of the assumed form p{x j Hq). It is clear that 
the hypothesis Hq is exactly equivalent to the hypothesis, say hg, that the n 
values y^, ..., (obtained from the a;’s by the transformation (2)) have been 
sampled subject to the probability law (3). Just as the point {x^, x^, x„) may 
be represented in an unlimited Ti-dimensioned space having probability density 

p(x^, »2, I Ho) = n I Ho)}, (4) 
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if Hq is true, so the point (j/j, yi , may be placed in an n-dimensioned hyper- 
cube with sides of unit length and with uniform probability density, if Hg and 
therefore is true. It follows that if is what has been termed a ‘ simple hypo- 
thesis ’, i.e. specifies the form of p{x \ H^) completely,* then the test of may 
always be transformed to the test of h^. If then it were correct to say that the test 
of a statistical hypothesis depeyids only on the form of the law specified by it 
follows that for the type of situation considered the testing of a statistical hypo- 
thesis could always be reduced to the following simple problem: 

To test whether a sample of n independent random variables y-y, y^, y,^ 

(0 < < 1) has been selected from the so-oalied rectangular distribution, i.e. the 

distribution for which p{y) = 1, (0 < y < 1). 

4. We are at once faced, therefore, with the question of how to test this 
simple but apparently fundamental hypothesis. If is true, the sample point 
is equally likely to fall at any point within the w-dimensioned hyperoube. Thus 
in picking out the critical (or rejection) region in this space we can get no assist- 
ance whatsoever from the changes in probability density, as we might do in 
theai'Space. If we wish to use a level of significance of a (say a = O-OlJ'forrejectihg 
^ 0 , it is clear that an infinite number of critical regions satisfying this condition 
are available; it is only necessary to select a region whose content is «. 

If we consider the n values of y and plot them in the interval (0, 1) as follows, 
• •• • •••• 0 0 
1 — I 1 1 1 r — —I 1 1 r— — I — I 

Fig. 1. 

the great majority of samples, from a rectangular distribution, at any rate if n 
is not too small, will be spread out fairly uniformly throughout the interval. 
Perhaps an ‘ideal’ sample against which to measure kregularities might be 
described as one for which the values of y fell at 

JL A A 2n,-l 

2«’ 2n’ 2?i’ 2n 

But what form of departure from this ideal of uniformity are we to pick out as 
suggesting that the hypothesis 7% is disproved? Should we judge significance by 
paying attention to the value of the mean y, of the variance, of the range of 
variation or of higher moments? Or should we use the ot tests? It seems 
difficult to find any basis for choice which could be regarded in any sense as the 
‘ best ’ . For any set of values 2 / 1 , ya, . . . , y„ some critical region of size a can always 
be found which will contain the sample point and therefore lead to the rejection 
of hfy. Indeed, the task of selecting a unique region on any rational basis would 
seem to be insoluble. 

* This condition is important. If the values of certain constants contained in the probability 
law need to be estimated from the observations, then the n values of y will not form a true random 
sample from a rectangular distribution. They will be subject to certain limitations to their degrees 
of freedom, though these may be relatively unimportant if n is large. 
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5. Directly it is recognized, however, that the choice of a test of a statistical 
hypothesis depends on something more than the form of the law associated with 
that hypothesis, it can be seen how a solution may he obtained.^ If we can specify 
a single alternative to Hq or a class of alternatives 0{H), then we shall have 
also an alternative h, or a class 0{h) to Thus, if 1 denotes a probability 
law alternative to \ H^), then for y the alternative is 

tor (S) 

where /(?/) means the solution of 

2 / ( 6 ) 


with regard to x. For example, Fig. 2 shows three typical forms of alternative 
piyjki), ptylh-J and A3) associated with alternatives p(xjlfi), ..., etc., to 
p(xjHo). 



Solid curve represents p(a!(j/p) 


piy\K) 

/ \ 


' p(y\i^2\l 

\ . / 


f 

p(y\h3) 1 
> \ / 

7 ^ 

/ ^ 

' 

0 



( 

/ \ ! 

1 

/ 

/ 

1 

1 


Solid rectangle represents p(yjhp) 
Fig. 2. 


We can now see the kind of test which will be most efficient for testing Bq 
with regard to possible classes of alternatives. If the alternative laws are of 
smaller dispersion (asp(a; j Bi)), we must be on the look-out for too many values 
of y near ^ and too few near 0 and 1. For alternatives with greater dispersion 
(a,sp(x \ Hj)), we must reject flg when there are too many y’s near, 0 or 1 and too 
few near |. While if the alternatives are likely to be asymmetrical curves (as 
I Afs)), then a different rule will be needed, as suggested by thep(2/ 1 A3) curve. 
6. It follows that in so far as it is possible to formulate the class of admissible 
probability laws p(x | H), the problem of selecting the most efficient test of Hq 
reduces to that of choosing a critical region in the 7i-dimensioned hypercube 
which is moat effective in detecting, from a sample of n values of y, differences 
between the rectangle p{y | Ap) and the appropriate alternative forms p{y | A). 
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If El is a single admissible alternative, then it has been shown (Neyman & 
Pearson, 1933, p. 298) that the region of content a in the hyperoube, within 
which n 

np(«/ilfeo) 

X— 1 j 

-11 <fc, - (7) 

n pii/iih) 

1=1 

or, in view of equation (3), n^p(yi i Ai)>p (8) 

where * is chosen so that P{(^i,y 2 , eWfl|fej} = a (9) 

has the following property. 

Of all regions of content a, Wg is more likely than any other to include the 
sample point when and not hg, is true. The region has been termed the best 
critical region for testing hg with regard to the alternative hi. 

As soon as Bg and Hi are specified, clearly | hi) and therefore the region 
Wg can be found, although mathematically it may be rather difficult to determine 

n 

the appropriate boundary H pIj/i I ^i) = constant, so as to satisfy (8). Since this 

1=1 

product is the probability density in the hypercube given by hi, it will be seen 
that what we set out to do is to include in the critical region those parts of the 
sample space where the density for hi is highest. It is here, on repeated sampling, 
that sample points would tend to be concentrated if hi is true, instead of being 
uniformly distributed as under hg. 

7. If instead of a single alternative hi, there is a class of admissible alter- 
natives 0(h), there may or may not be common points of concentration that can 
be included in the critical region. This wiU depend on whether the inequality 
(8) above defines a region independent of the particular hypothesis h of the class 
C{h). Even if there is no single region of content a which is exactly a ‘best, 
critical region’ for hg with regard to afi members of C{h), the general principle 
may still be used as a guide. We build up a critical region out of those parts of 
the hypercube where the probability density tends to be concentrated when the 
probability law departs from p{x | Hg) in the direction of the alternatives included 
in G{H). 

Eor example, in my earlier paper (Pearson, 1938) I suggested as appropriate 
in the following situation a test which, while not based on a common best critical 
region, was selected so as to include regions of greatest density associated with 
alternatives of G{h). Eor the hypothesis tested. 

The alternatives are asymmetrical curves with the same mean and standard 
deviation as (10). A typical alternative would be the Type III curve 

A 1 A* 

p{x 1 E) = c(l + e VA , (11) 


Biometrika xxxii 
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whose foxna. departs more and more from (10) as increases from zero, but the 
class need not be defined as precisely as this. In this problem it appears that 
if n independent observations ajj, ce^, available, the following is a good 

test of flj,- Take as test function n 


V 

<2 = 

(12) 


iesX 


where 

y; = 5(0-2 -y,) for 0<yi<0-2, 1 
3/;-i(2/.-0-2) for 0-2<y,<0-8, 
2 /i = 5 (l- 2 /i) for J 

(13) 


r* 1 


and 


(U) 


If Ha is true it may be shown that — 2 log^, Q is distributed as 
degrees of freedom. Hence any desired significance level a, for Q, may be found. 
We should then reject when Q is significantly small. 

A more systematic method of dealing with such problems has been con- 
sidered by Neyman (1937) in his paper on ‘smooth tests’. 

8. To sum up, the position seems to be this. It has often been argued that a 
statistical teat need only depend on the form of the probability law associated 
with the hypothesis tested. In the case where Hq concerns the probability law 
of a single random variable and where p{z ) Hq) is precisely specified, by the trans- 
formation from a; to y it has been shown that the problem of testing on the 
basis of n independent values of x can always be reduced to another problem, 
which involves this question. Can we regard a sample i/^, as having 

been drawn from the rectangular distribution p(y\ha) = 1, where We 

are faced with a single fundamental question and we have to consider whether 
it can be answered in a rational manner, unless we are prepared to take into account 
the kind of departures from the rectangular law that we either believe possible 
or at any rate consider it most important to be on the look out for. 

The transformation from x to y seems to have the advantage that it con- 
centrates attention on the main point at issue. That is my reason for emphasizing 
it in these Notes. Most of us have many preconceived ideas about appropriate 
tests if the probability law is taken in the form of p{x | Ha); we are accustomed to 
use the mean, the standard deviation, certain functions of moments, the y® test, .... 
But we are not so accustomed to test whether a sample comes from a rectangular 
distribution and we are therefore forced or, indeed, more willing to reconsider 
from first principles what course we should follow and why. 
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In dealing with the problem of testing statistical hypotheses J. Neyman (1937) 
and E. S. Pearson (1938) have considered the use of the probability integral 
transformation, which leads to a theoretical uniform distribution. This method 
presupposes that the usual comparison between theory and observations has 
already been applied. We shall first improve this comparison by introducing 
control curves. Then we shall apply to the uniform distribution the usual methods 
and the control curves. This will lead to simple tests for given statistical hypo- 
theses. 


1. Control citeves and the probability integral transformation 

Let a: be a continuous random variable for which n observations have been 
made. Let be the observed values arranged in increasing order of magnitude, 
m (1,2, ...,n) being the serial number. The simplest way of representing the 
observations is to plot the cumulative histogram x,n> tR- The relative number 
(a:„) of observations less than or equal to x„, is given by 

m = 7iW®>(a;„). 

The consecutive differences 

{l<m) 

constitute the observed distribution. Many statisticians present, instead of the 
original observations x„, , only the number of cases within certain arbitrary classes . 
From the practical standpoint this means a simplification, from the theoretical 
standpoint a complication. We shall suppose that all a;„ are known. 

The choice of a probability density to be applied to the observations con- 
stitutes the hypothesis. The probabihty density w (x,Cy,C 2 , ...), where Cj, Cg, ... 
are the constants, is called the theoretical distribution. For sake of simplicity it 
is assumed that aU observed values x„, have the same theoretical distribution. 
The probability W (x, c^, c.^, . . . ) of a value equal to or less than x is given by 

W(x,c^,C2,...) w(z,Ci,Ci,...)dz, a) 


21-2 
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where z is the variable of integration. It is customary to compare the observations 
m with the cumulative frequency curve x, nW{x). This comparison can be 
improved in the following way: The mth observation is a statistical variable 
distributed according to 







( 2 ) 


In a previous article (Gumbel, 1936) it has been shown that, for an ordinary 
unlimited distribution, with large n and with m of the. size ^n, the distribution, 
(2) converges towards a normal distribution with a mean given by W{x) = mjn 
and a standard deviation 


1 i W(x)(l~W{x)) 

w{x) sj n 


( 3 ) 


This formula does not contain m explicitly. Since each theoretical value x can 
be interpreted as an mth value, (3) gives its standard deviation. The interval 
X + O' will be called the control interval. Under the above condition, the probability 
that an mth value will fall within the control interval is about f . The two curves 
obtained by plotting a; ? <r, nW{x) will be called control curves. 

For a given initial distribution we shall have to find the mean and the standard 
deviation of the mth value which may diflPer from those of the general solution, 
especially if w is small. The control interval will be a certain function of w{x). 
Also the probability associated with the interval x^cr may differ from that of the 
general solution and may depend upon x. For the exponential distribution 
(G-umbel, 1937) the precision diminishes with increasing values of the variable. 
Below we shall apply this control to the uniform distribution w[x) = constant. 

The calculation of the probability W{x) and the control curves can often he 
. simplified by an indirect method. For certain, but not all, distributions it is 
possible to eliminate the constants by introducing a new variable y as a function 
of X, where 


“:=/(y,Ci,Ca, ...). 


(4) 


Accordingly, the probability that a value of the transformed variable will fall 
in the interval y to y+dy is 

w{x) dx = wifiy)] I f'{y) | dy. 

We call P(y) = w[f(y)]\f{y)\, 


or 


p(y) = w(x) 


dx 

dy 


( 5 ) 


the distribution of y. The probability V(y) of a value equal to or less than y is 


F(y) = W{a:,ci,c„...). (6) 

The transformation (4) is chosen such that the expression V(y) does not contain 
any constants which depend upon the observations. Therefore V(y) can be 
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calcdlated once and for all as a function of y. Such tables have been calculated 
for the distributions for which this reduction is possible. 

In order to compare the cumulative histogram m with the cumulative 
frequency x, nW{x), and to use the control curves, it is first necessary to compute 
the constants c^, c^, .... If the method of moments is used, the area between the 
cumulative histogram and the horizontal line If = 1, the arithmetic mean, is 
conserved. The value of the variable x which corresponds to a selected, numerical 
value of W(x) is obtained from the transformation (4). 

A special case of the transformation (4) which leads to a reduced distribution 
of astounding simplicity is the probability integral transformation due to Karl 
Pearson (1933) who introduced for y the probability function W{x). Since 

da; _ 1 

we obtain from (6) p(lf)sl. (g) 


This identity means that the distribution of the probability is constant. As 
p(lf ) is the probability density of a probability, it is difficult to establish its 
philosophical meaning. But formally the construction is valid and the corre- 
sponding value can be observed. It is our purpose to give .several methods of 
judging the significance of the differences between the theoretical distribution (8) 
and the corresponding ‘observed’ distribution. 

The word ‘observation’ wiU be given a special meaning. A certain theory 
which involves the choice of certain constants c^, c^, ... applied to the observa- 
tions, leads to the values 


= lf(a;,„,Ci,Ca, ... l^o) (to = 1, 2, 

corresponding to x^. These values, contained in the interval 0, 1, are the ‘^obser- 
vations ’. To any other set of constants c^, Cj, ... will correspond other ‘observed’ 
values 


W',^W{x^A,c^,. 


I ^o)- 


Therefore any test applied to the ‘observations’ of formula (8) might be used to 
judge the choice of the constants. To another hypothesisAi containing the con- 
stants dj, dg, ..., will correspond another set of ‘observed’ points 

Wi = If (a;„(, dj, dg, ... | 

The same observations when interpreted by different theories or different 
constants lead to different ‘ observations ’ . An incorrect theory involving properly 
chosen constants might give better results than a correct theory involving im- 
properly chosen constants. Therefore, to compare different theories, the constants 
for each must be determined with the same precision. In practice this condition 
will not always be fulfilled. For the precision of the characteristics depends upon 
the distribution for which they are calculated (Gumbel, 1936). Therefore, the 
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same method of determining the constants might lead to dilfea’ent degrees of 
precision for different distributions, whereas different determinations of the 
constants might lead to approximately the same precision. 

There is another point for caution: all tests derived from the probability 
integral transformation apply to the analytic form of the hypotheses and at the 
same time to the choice of the constants. A formula containing many constants 
may reproduce the observations more closely than a formula containing few 
constants, even though the constants in the first hypothesis have no meaning. 
Therefore we must limit onr comparison to hypotheses containing the same 
number of constants. For any set of statistical observations in the ordinary sense, 
there will usually correspond a small number of tenable hypotheses. We shall 
suppose it is known what they are. For we do not try to find a formula for the 
sake of doing it, but to explain the observed facts. We will not go so far as Neyman 
(1937), who formulated all possible alternfl-tives by a series of orthogonal 
functions. 

In theory the points representing are distributed uniformly in the 

interval 0, 1. This is true for any hypothesis, .provided the variable is continuous. 
But in practice this will never occur. The ‘observed’ points corresponding to any 
given hypothesis will differ from the theoretical set, even if the hypothesis is a 
very good one. The differences between the ‘observed’ set of points resulting 
from ft-u K ^'Hd the theoretical set allow the construction of tests which 
can be used to judge which of two given hypotheses is the better. But no 
statistical method gives an answer to the question whether or not a hypothesis 
is true. 

After a hypothesis has been selected, the preliminary steps which have to be 
made before it is possible to use the probability integral transformation, are: 
first, the determination of the constants; secondly, the calculation of probabilities 
W (x) for the values of x given by the transformation (4) ; and thirdly, the calcula- 
tion of the probabilities Wix^) of the observed values. It is only after these three 
Operations have been carried out that we obtain the ‘ observations ’ which are 
to he compared with the theory (8). Therefore, any test based on the probability 
integral transformation presupposes the usual comparison of the observed 
cumulative histogram with the frequency curve. In many cases this comparison, 
checked by the control curves, will indicate a clear superiority of one of the 
theories. If this is true, there is no necessity for a new test. 

It would be interesting to investigate the best criterion for judging the 
significance of the differences between the ‘observed’ and the uniform distribu- 
tion (8). But for practical purposes it is sufficient to know whether the differences 
for hg are smaller or larger than for First we shall establish rough measures of 
comparison; afterwards, more refined ones. 
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2 . Classical tests applied to a xjnieorm DiSTRiBUTiofi 

The comparison of an observed distribution of a continuous variable "with the 
theoretical distribution is reduced by the probability integral transformation to 
a comparison of ‘observed’ points with a uniform (bstribution. It seems logical 
to use first the classical methods which are here very simple, as no constants have 
to be determined. For a uniform distribution 


p(2/)=l (O^ygl), 

the arithmetic mean and the median are, respectively, 

y = y = i- 


( 9 ) 


The mean error d and the probable error p, defined as half of the difference between 
the two quar tiles, are 

d = p = 1. (10) 

The fcth moment about the origin is 


which gives the recurrence relation 




k + V 




( 11 ) 


Since the distribution is symmetrical, the odd moments about the arithmetic 

mean vanish. Therefore „ _ 

P\ = 0. 

The even moments are 


= f = 2 r*z**=dz or = 

Jo Jo 


1 


2“(2i:+l)‘ 

Therefore the standard deviation, the coefficient of variation and the second 
beta are, respectively, 


"■“2^3’ 

(13) 


(14) 

A = f- 

(15) 


and 

It is necessary now to calculate the ‘observed’ means, the measures of dis- 
persion, and the relations between successive moments. To control the agree- 
ment between the theoretical uniform distribution and the ‘observed’ points 
we can still employ the standard error of the arithmetic mean. The general 
formula _ 
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becomes according to (13) 0 ^ ~ • 

The standard error of the dispersion for n large is cr(cr®) 
becomes, according to (12), 

7(4% (5 “9)) ^ & f { 5 ny 


( 16 ) 

fciil, ^hich 

a/ n 

(17) 


It seems reasonable to employ these old-fashioned tests before the use of 
more sophisticated methods is resorted to. Only if they fail would it be necessary 
to consider more elaborate methods. 

The n ‘observed’ points W(x,fj represent the probabilities, obtained from the 
hypothesis A,,, of the given observations in the ordinary sense, x„j. We plot these 
points in the interval 0, 1 which is divided into h cells of equal length, where h 
is chosen in such a way that njk is an integer. If n is a multiple of 10, we choose 
the cells (O-O.O-l). (0-1, 0-2), ..., (0-9, 1-0). 

The probability density of a point faUing somewhere within the interval 0, 1 
is constant. As the interval is of length 1, this density is 1. Therefore, the prob- 
ability of a point lying with a given cell is 1 jk, and the expected number of 
points in each cell is njk. The ‘observed’ number of points obtained through a 
hypothesis Hq will be (v = 1, 2, ..., fc). If we apply another hypothesis h-^ to the 
same observations or introduce other numerical values for the constants, the new 
set of ‘ observed ’ points will lead to values by which, in general, will differ from Uy . 

The classical statistical method of treatiag this material is the As the 

numbers o„, by will differ from the expected number njk, we can calculate for both 
hypotheses , ^ ,, 


The better hypothesis wiU have a lower value of and a greater value P, where 
P denotes the probability of obtaining the ‘observed’ deviations from uniform 
distribution or larger ones. The probability P depends upon the number of cells 
chosen. Therefore, to compare two competing hypotheses, the same division must 
be used. 

The apphcation of the y* test to the ‘observations’ eliminates an 

arbitrary action which is a serious and well-known drawback of the test, when 
applied to the original observations The expected contents of the classes 
depend upon the distribution. Therefore certain classes, as a rule the first and the 
last, must be chosen such that the expected number is not too small, otherwise 

becomes very large. In our case, no cell differs from any other and no arbitrary 
combination of cells is needed. We can choose k — n. The mean number of points 
in each cell will then be one and 



n 


)>=i 






( 18 ') 
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This choice removes another drawback of the method: different classifications 
■used for the same observations lead to different shapes of the distribution and 
therefore to different values of x*. Here the classification is prescribed once and 
for aU. 

Another comparison between theory and ‘observation’ may be based on the 
fact that different seta of points and have different probabilities. The prob- 
ability that Oj, points will fall within the cell v {= 1 , 2, . . . , it) is 

n\ /l\ai+aa+."+ai 

aJa^fTTr^! \i; ’ 

where ai^+a^+ ...+a^, — n. 

Since the factor n ! is constant, it is sufficient to investigate 

n=~. (19) 

n«v! 

Of course IT <P, as the latter probability applies to the ‘observed’ deviation or 
larger ones. The statement ‘The probability for points to be contained in the 
cell V is proportional to IT’ may be inverted according to Bayes’s principle. 
Therefore, IT is proportional to the probability that the distribution of points 
is rectangular, i.e. that is a good h 3 rpothesis. 

The question for which set a,,,the probability TT is maximum, is the starting- 
point of the classical relation between entropy and probability. Por large n the 
most probable set of points is the one which has the same number of points in 
each cell, i.e. 

Si = Sjj = ... = • (20) 


Let us call probability which corresponds to this distribution. The 

probability of the hypothesis will be greater, equal to, or less than the prob- 


ability of h^, if 


-Hq > IIj^ 
Hjnax. '' Hjaax. 


( 21 ) 


The relative probability of both hypotheses will be njir^ or Ilgini, depending 
on whether 


As these probabilities depend on the nxunber of cells, the same di-vision must he 
used to compare two competing hypotheses. We can choose 1c =n. Then JTj„ax. = 1 
and we can use Hq as test. 

The entropy test (21) is closely related to the (18). This classical 

relation can be obtained in the following simple way: if q is the constant prob- 
ability of a point falling within a given cell, then for n observations the expected 
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number of points within a cell is nq. But the ‘observed’ numbers will differ 
from the expected number by so that 


a„ = nq+e^, 

where S = 0- 

The quotient (21) becomes 

^0 AjNIL. 

When n is large, each factor becomes, by application of Stirling’s formula. 


nqi 


{nq + e, 

Expansion of the logarithm leads to 
nq\ 


Jl - (4)" «P [ - (»? -K-. + i) h (l + ^)] ■ 


(«g + e„)! 

According to the meaning of we ol^taui 

no 




n„ 




whence 




n 




max. 


( 22 ) 


Therefore, when n is large, the entropy test becomes identical with the test. 
This result was derived by Neyman & Pearson (1928), when they showed that 
the test followed from their ‘likelihood ratio’ method of approach. 

Neither test will give an answer, if the number of points a„ assigned by to 
the cell V is equal to the number of points assigned by to the cell A, = 6;^, 
where for any v{= 1, 2, it is possible to find a A (== 1, 2, ..., A), such that not 
all A = V. An example of this occurring is shown in Table 1, col. C and E. The 
reason for the failure is that we do not make use of the actual position of the 
observed points within the cells. We only ask in which cell they are situated. 
Although in such a case the teats do not show any difference between the arrange- 
ments and 6^, some conclusions might be drawn from such ‘observations’. 
If the number of points falling in the first few ceUa and also in the last few cells 
is disproportionately large, and if there is a deficiency in the middle cells (Table 1, 
col, D), we have to conclude that the distribution hg is too concentrated or that 
we have chosen too small a value for the constant which depends only on the 
standard deviation. If the number of points in the cells at either end is small 
(Table 1, col. B), the inverse inferences follow. These considerations may give a 
hint about the choice of an alternative hypothesis. , 
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To illustrate the above methods, let us take the fictitious example given by- 
Pearson (1938) in his Fig. 2, p. 136. He arranges w = 10 points in /fc = 10 cells 
and considers six sets A, B, ..., F, given in Table 1. 

Let us suppose that these six sets are the results of six different hypotheses 
apphed to the same observations. The test leads to 

Pj^> Pz,> Po = Pp> Pjs = Pe- 

The probabilities of the various columns give the same ordering 
^max. ~ ^ ^ PPc ~ P^F ^ P^B — P^E- 

The most probable set contains one point in each cell (set A). It is not possible 
to decide -whether 0 is more probable than F, and -whether B is more probable 
than E. 


Table 1. Pearson's set 


Class 

A 

B 

c 

D 

E 

F 

O'O-oa 

1 

2 

0 

2 

0 

0 

0 - 1 - 0'2 

1 

3 

0 

1 

0 

1 

0 ' 2 - 0-3 

1 

2 

0 

2 

1 

2 

0 - 3 - 0'4 

1 

1 

0 

0 

2 

2 

0 - 4 r - 0-5 

1 

0 

1 

0 

2 

0 

0 - 5 - 0-6 

1 

1 

2 

1 

3 

1 

0 - 0 - 0'7 

'1 

0 

1 

0 

1 

0 

0 ' 7 - 0-8 

1 

1 

2 

1 

1 

0 

0 - 8 - 0-9 

1 

0 

2 

1 

0 

2 

0 - 9 - 1-0 

1 

0 

2 

2 

0 

2 

y* 

0 

10 

8 

6 

10 

8 

p 

1 

0-350 

0-634 

0-740 

0-350 

0-634 

n 

1 


A 


- 1 . 

24 



The ■)^ and the entropy test are based upon the same data. But the results 
reached are incomplete, as artificialities are introduced by the classification of 
the ‘observations’ into the arbitrary cells. The actual position of the points 
within the cells is not used. The set A shows that these tests may be misleading 
in still another way. Each cell in set A contains exactly the expected number. 
But it would be false to conclude that the hypothesis is true, since the actual 
positions of the points within the cells might differ from the ideal positions. 

Let us suppose we know these positions. It might then happen that the 
difference between the observed and the ideal positions of the points is smaller 
for a set K than for a set L, even if the differences between the actual and the 
theoretical number of points is larger for K than for L. 

It is now our task to assign a meaning to the term ideal position and to define 
a measure of the differences between the ‘ observed ’ and the ideal set. 
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3. The toth point test 


The ideal position of n points, distributed -with nniform probability over the 
interval 0, 1, is such that the distances between consecutive pairs of points are 
equal. But there are a number of ways of distributing n points equidistantly 
over the interval 0, 1. E. S. Pearson, in the preceding note, suggests that 




2m — 1 
2n 


(23) 


might be used as the ideal position of the mth point. However, as y is a statistical 
variable, we should represent it by an average, to choose which we must consider 
the distribution of the mth point.* Any observation chosen at random has the 
same probability of falling on any position y within the interval. But for the mth 
point this probability depends upon y and m. The initial distribution w of the 
variable W{oc) = y is constant. According to (8) 

w(2/) = 1 (0<ygl). 

The probability of obtaining a point equal to or less than y is y. According to 
(2) the distribution pf the mth point is 


my^~\l - (24) 

The distribution (24) is of Karl Pearson’s Type I. For m == 1 (and m = n) the 
distribution will only decrease (increase). The distribution of the mth point is 
equal to the distribution of the (w - m + l)th point. If we replace y by 1 — y 

-y,n) = tojy, n). (24') 

The most probable position y of the mth point is given by 

n — m _ TO— 1 

1-2/ ~ y ’ 


which leads to 


ym — 


m— 1 
ra— 1 ‘ 


(25) 


For given values of m the median position can be obtained from the tables of 
the Incomplete Gamma Function. To find the arithmetic mean y and the control 
curves, it is necessary to have the moments iJ4 of (24). They are 

-y)«-’«dy. 

According to the well-known properties of the Gamma function 

wd (m-tfe- 1)1 (w-m)! _ nl (m -t- 1: — 1) ! 

(m-l)!(w-m)! (« + fc)! (w4-^)! (m— 1)1 

* [The distribution of a ranked individual sampled from a rectangular population, and the 
moments of this distribution, were obtained by Karl Pearson in the first of two papers {1931, 
p. 390, and 1932) dealing with ranked variates. Ed.] 
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Therefore 

For ib = 1 the arithmetic mean of the iwth point is 
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(26) 




m 


n+r 


and for ib = 2 the second moment is 




_ m + 1 


.Finally, the variance 


or. 


2 _ 


n + 2' 
m(n— m+ 1) 


(27) 


(28) 


(a + l)2 (m + 2)' 

These formulae apply also to the cases m = 1 and m — n. The standard deviation 
of the with value may he written 

Vm) 




71 + 2 


(28') 


This formula differs slightly from the general expression (3), and leads to an 
unexpected result; as we approach the centre from either side, the precision of 
the mth point decreases. The precision of the mth point will be a minimnTn for 
wi = |?i+ 1, if 71 is even, and for m = ^( 71 + 1) if n is odd. The values of (r„,y(7i + 2) 
are given in Table 2. 


Table 2. Standard deviation of the mth point 


Vm 

Vm 


0'06 

0-95 

' 0-21794 

010 

0-90 

0-30000 

015 

0-86 

0-35707 

0-20 

0-80 

0-40000 

0-26 

0-75 

0-43301 

0'30 

0-70 

0-46826 

0'35 

0-66 

0-47697 

0-40 

0-60 

0-49000 

0-46 

0-66 

0-49760 

0-50 

0-60 

0-60000 


We must now decide whether to use the mean (27) or the mode (26) as the 
ideal position of the mth point. The modes of the first and of the last points are 
0 and 1 respectively, whereas the corresponding means are 

As the ‘observations’ Wo(a:i) and Wo(a:„) of the first and the last point differ 
from 0 and 1, the arithmetic mean is to be preferred. 
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Formula (27) gives the ideal position and therefore the theoretical numbers 
of points in each cell which can be compared with the ‘observed’ numbers. This 
method leads to an improvement of the tests (18) and (21) where the choice of 
the cells was stiU arbitrary and where the actual position of each point was not 
taken into account. 

Besides comparing the uniform distribution with the ‘observed’ position of 
the points we can use the corresponding cumulative frequency. The probability 
scale y is plotted as abscissa and m as ordinate. We count the number of points 
below m. The mean position y of the mth point becomes a straight line differing 
from the diagonal which represents the modal positions. The figure opposite 
traces, for n ~ 20, the mean, the modal and Pearson’s position of the wth point 
given by (23). 

In the same way we plot the ‘observed’ points obtained by .... These 
probability points Wq(x^), Wy{x^ will be scattered about the straight line. Usuahy 
the area between the observed cumulative frequency curve, the ordinate and the 
parallel to the abscissa is kept equal to the corresponding area for the theoretical 
curve. Since for the present problem no constants have to be determined, we 
have no way of enforcing this equality. The area J bounded by the diagonal 
Straight line through the points with the co-ordinates to/(u.+ 1), to (w = 1, 2, ..,,»), 
the length !/(«.+ 1) to nj{n + l) of the abscissa axis and the two parallels to the 
oiclin.te axta, is J = ( 29 ) 

This might differ from the area of the n~l ‘observed’ trapezes 

(m = 1, 2, 1). 

As y^ are the ‘ observed ’ points 


»— i 

= S {m + i)iy,n+l~ym) 

m«=l 


i 


n—l j n 

- 2 

1 ^2 


1 

2 ^ 


= ny^-'^ym-^Uyn-yi)- 

1 


If we replace each value by its expectation from (27), we get 

(29') 

as it ought to be. The ‘ observed’ area is not equal to the theoretical area, but its 
expectation is. It might happen that the numerical value of JW is very close to J 
as a result of compensating deviations. Therefore this numerical comparison can 
be used as a test only in connexion with the graph of the ‘ observed ’ and theoretical 
cumulative histogram of the mth points. 
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To control the agreement between the ‘observed’ cumulative histogram and 
the ideal straight line we use two control curves through the points y^(T, m, 
where y is given by (27) and cr by (28')- They are traced in the figure for n = 20. 



Probability y— 

Fig. 1. Control curve for uniform distribution. 


Mean mth point y (27) Modal mth point J? (25) 

Pearson’s mtb point ^ (23) j — [ — | — ( Control curves 


It is interesting to calculate the area A bounded by these two curves and to 
compare it with the area J. If we consider m as abscissa and y + cr and y - cr as 
ordinates we have, for n sufficiently large, ' 

fn 

^4 = 1 [y+<r-{y-(T)]dm 
= 2 J (rdm. 

If we introduce y as variable of integration we obtain from (28') 


2) J i/(n+l> 
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The transformation 

y ~ sin’^ t, l-y = cost, dy ~ 2 sin t cos t dt, 
leads, as is well known, to 

cosiJij. 

TZ* “h 1 

The limits are given by 

For the expansion of (j it is sufficient to put 

.11 _ 1 1 

° arcsin^^^^ 6(» + l)^(» + l)’ 

provided (m + 1)^> 1. Under the same condition, 
arc sin f(\~ x^) = arc cos x 

= |rr — arcsina:, 

becomes for any | x* | < 1 

arosin^(l — a:®) == ^n — x + ^x^. 


Therefore 


so that 


t 

- o 


1 


1 


2 .,J(n + iy6.^{n+l)(n+l) 
1 1 


2 4 ^(w+l) 6(71 + 1 ) + 1) 

The second factor in the brackets becomes 

2 smi(,^(l-sin®to) (cos® ip — sin® to) == Bsintj V (1 “®ki®to) (1 — 2 sin®fo) 

In the same way 

(.-i) = 

Finally, we reach the area bounded by the control curves 
. n + 1 tn (n—l)^Jn 1 1 

~fin + 2)\i'^ (w + 1)® "V0^)’'’6(» + l)V(w + l)j 

According to (29) the ratio of the area bounded by the control curves to the area 
of the cumulative histogram 


))• 


A n.+ l 


J ra-lV(» + 2)\4'^ («.+ l)® V(®®+l)'^6(m+l)V(«+l) 

converges towards zero as 1 : 


{n-l).,jn 1 




( 30 ) 



E. J, Gumbbl 


331 


The prop,rti« (29') and (30) of the camnMve hietogtaM of the poeition. of 
the ».th points allow for the comparison of the 'observations ' WJx ) and the W„! 
pomts „ 1). It will often he sufBeient to inspect riie devS"^ 

observatiom, and theory to judge which « of 'observed' points is closer! 
on ‘he who e, h, the theoretical positions. It seems legitimate to prefer a hypo- 
thesis ho if the control area contains more points for hg than for A 

In order to secure a numerical lest, we can introduce the mean of the sum 
of the squares of the differences between the ‘observed’ values v = W(x ) anH 
the mean positions = m/(7i + 1). Take “ 


h ^ 

= - S 


/ m Y 
r”* 71 + 1 )' 


(31) 

where the value of the constant k will be specified later. One extreme for (31) 
would be to have the theory hold for every point. Then the value of the sum 
would be zero. The other extremes would be when all points are concentrated 
either at the origin, zero, or at the end, 1. In the first case 

1 


^ 2 2w + 1 


«(» + !)“ 1 ~ 6(n + l) ■ 

The second case leads to' the same value, since Em = Jii(n+ i) and therefore 

12 1 


-- i (l = 1- 

7l + lj 


1 + 


w(m + 1)2 


n 

2 m.2. 
1 


Therefore 




2?i+l k 
'6(71 +1)^3' 


(32) 


In order to draw conclusions from an observed value 8^ ^ve have to calculate its 
expectation 82 _ Wg will determine k in such a way that is independent of m. 

A test similar to (31), but serving another purpose, has been introduced for 
the. usual distributions by H. Cramer (1928) and R. von Mises (1931). When 
applied to uniform distributions, this test leads to the use of of (23) instead 
of the mean value y„,. For this test the sum of the deviations is zero which does 
not hold in our case. The expectation of @2 jg 

2my,„ 


@2 


- 1{ S ,[s5.- 






r.])- 


71+ 1 (n+l)2 

The first two sums are obtained from (27) and (26). Therefore 


©2 == 


/ Sm(7?i + 1) 

n \ 

Sm- 

\{7i+ l)(7l+2) 

(71+ 1)2/ 




m 


n 

1 


*4- 1 ) 4* 2 (h “j- 1 ) (w -f- 2)/ 
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The introduction of the sum of the powers of the natural numbers leads to 




/ 2n+l\ 

jV 3u+3/ 


2(?i + 2)\ 3«.+3/ 6(n+l)‘ 

Taking = 6(n + 1 ) we i)ropo8e therefore, as test of a hypothesis Aq, the coefficient 


@2 = 


6(n+ 1) 


n 


) ™ / mV 


(33) 


which, according to (32), can assume values between zero and 2 m + 1, and has for 


expectation the value 


S'*= 1. 


(34) 


Of two competing hypotheses, the one with the smaller value of & is to be 
preferred. 

The criterion does not introduce any arbitrary classification. It makes 
use of all observations. Besides the probabilities corresponding to the 

observed values no new calculations are needed. The test has a clear meaning 
and its application is simple. This is due to the fact that it is a natural consequence 
of the probability integral transformation. 


Summary 

We propose the following procedure for testing statistical hypotheses; The 
constants for competing hypotheses, having the same number of constants, are 
determined in such a way that their precisions are approximately the same. Then 
we calculate the probabilities lFo(a;,„), Wi(a:,„), ..., and their respective control 
curves. We trace If («„,), a:„ + cr„i and compare it with the observed frequency 
curve. If neither the classical tests nor the control curves indicate a clear superi- 
ority of one of the hypotheses we consider the probabilities as 'observations’ 
and plot the corresponding points on the y axis. We now compare, by formulae 
(9)— (17), the ‘observations’ with the theoretical uniform distribution and apply 
the and entropy teat of formulae (18) and (21), respectively. If necessary, we 
repeat these tests in such a way that the actual position of each point is taken into 
account. Formula (18') gives a value of which is independent of the classi- 
fication, Then we plot the cumulative frequency of the 'observations’, which is 
compared with the straight line (27) and controlled by the values given in Table 2. 
The final test is given by (33). 
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THE RANGE IN RANDOM SAMPLES 

By H. 0. HARTLEY 

1. iNTEODUOTIOlf 

If the observations (i' = 1, 2, of a random sample are arranged in 
ascending order of magnitude > *,) the range w in such samples is defined as 
the distance between the two extreme observations 

V) — X„—X^. 

It may therefore be regarded as a measure of the variability or dispersion among 
the observations of the sample. Theoretically its efficiency in the sense defined by 
R. A. Eisher is, as a rule, much inferior to that of the standard deviation. More- 
over, extensive investigations have shown that its random sampling distribution 
is markedly dependent on the parental population (E. S. Pearson, 1926). For 
large samples drawn from a parental distribution f(x) the extreme values 
and a:„ will lie right inside the lower and upper tail of f{x), and in practice it is 
only in exceptional cases that the exact shape o{f{x) has been established to such 
a degree of accuracy that the resulting distribution of w can be trusted for large n. 
In most cases the use of the range must therefore be limited to small samples, 
say with 2 < n < 20. 

Large numbers of small samples may often be used with advantage when the 
mean range is calculated as an estimate of the standard deviation of the popula- 
tion (Pearson & Haines, 1936). Although theoretically such an estimate is not 
efficient and certainly not sufficient, it is nevertheless of considerable importance 
in many fields of application because of its simplicity. Statistical control charts 
in industrial quality control make extensive use of it, and more recently the 
range has been applied to investigations in gunnery. 

In some fields of application a disadvantage may arise from the fact that the 
range is an inexact statistic; its random sampling distribution depends on the 
standard deviation of the parent. This applies in particular to the analysis of 
small samples in biological experiments. The tendency of modern small sample 
theory has been to replace such statistics by what are called exact statistics, 
obtained by substituting for the unknown standard deviation of the parent an 
estimate calculated from an independent sample. This particular process of 
reaching exact statistics has sometimes been referred to as ‘Studentization’. 
A general theory of this process will be given in a further paper which it is hoped 
to publish in this j ournal, where it will be shown how estimates of scale parameters 
in general, and of the range in particular, may be converted into exact statistics. 
In this paper, however, we deal with the case where the standard deviation of the 
parent is known. Indeed, it is this dependence of the random sampling distribu- 
tion of range on the scale parameter of the parent which makes it possible to 
estimate its efficiency as an estimate theoreof. 
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The question of grouping has been a subject of investigation in the case of 
the sample standard deviation; we shall here deal with the effect of grouping on 
the range, a problem which has so far received, we believe, no attention what- 
soever.* As practical examples of the occurrence of grouping we may quote 
three instances; 

{a) The rounding off of data for convenience of recording and analysis. 

(6) The recording of data to the nearest unit of measurement. Where the 
technique is of low accuracy (see e.g. Tildesley, 1940) the unit of measurement 
will be comparable in magnitude with the standard deviation of the actual data. 

(c) The analysis of data which are classified in categories. In such cases we 
may often find that the original data are unobtainable so that group frequencies 
are the only material available for an analysis. 

It will also be shown how the random sampling distribution of the range in 
grouped samples provides a suitable approach to that of the true range (ungrouped 
range in sample) on which extensive work has been done in earlier papers published 
in Biometrika. The mathematical formulae developed in this paper make this 
complex distribution amenable to a tabulation. For the case of normal samples, 
the work has actually been carried out and the resulting tables of the probability 
integral are given and discussed elsewhere in the present issue of this journal. 

2. The distribution of the range in a grouped sample 

Let us denote by aji, ..., the observations in a random sample drawn from 
the parental distribution /(a:)t and arranged in ascending order of magnitude. 
This sample is now classified in groups or categories of constant length h with 
equidistant end-points 

..., g-kh, ..., i-h, g, i + h, ..., g + fc/i, .... (1) 

covering the whole x scale from — oo to +co. Let us denote by and the 
respective centres of the categories containing and aii. Then the problem is to 
find the random sampling distribution of the range in a grouped sample, i.e. of 
The mean of this distribution is of particular interest. Obviously this 
statistic can only assume values which are multiples of the group interval h and 
is therefore discontinuous. Like the distribution of the ‘ungrouped’ or true range 
it depends on the standard deviation cr of the parent /(x). In addition, it depends 

* The elFeot of grouping seems to be of some importanoe in researches on the technique of 
anthropological measurement (Tildesley, 1940), where some of the results given below have already 
been applied before this paper had gone to press. 

t We shall deal here with a parental population represented by a ‘piecewise continuous distri- 
bution function f(x). A function is ca.lled ‘piecewise continuous’ for -co<x< +co if in any 
closed interval of x the function f(x) is continuous apart from a finite number of ordinary dis- 
continuities. If the actual range of the variate is bounded w'e simply define /(»)=0 outside this 
range. Moreover, we assume that /(x) has contact of at least second order at +co. It is easy to see 
how our results may be generalized to cover distribution functions with singularities. 



336 


The range in random samples 


i+vi\n 


on the category width h and on the position of the category midpoints relative to 
the population mean X. Of these parameters only h will in practice be known. 
Methods to eliminate cr are to be given in a separate paper whilst the elimination 
of X is dealt with in the section on randomized grouping (6). 

It wiU be convenient to use the following notation: 

M ri+jh . fi c® foo 

= f{x)dx, = f(x)dx, = f{x)dx. 

Ji J i+ih J —oa J —oo Ji J ^+ih 

Let us now find the chance that ~ 

is at most {m—l)k, and that in addition 

for a particular value of i. This chance is given by 

-(Lj • 

The first term in (2) represents the probability for all to lie between i + ih and 
^ + {i + m)h. Prom this we have to deduct the chance for all a;„ to lie between 
g + (i+ and g + (i + m) A. which is given by the second term of (2). In taking 
the difference we are therefore left with the chance for the occurrence of a sample 
completely contained in the interval ^+ik to (i + m) h but with at least one 
of the Xi lying between and +!)/». This proves that (2) represents 
the required chance. Now, since all samples may be classified with regard to 
their lowest category, the probability for fo be at most (m — 1) /i is given 

by summation over all i of the expression (2). If we denote this probability by 
P{n,h,m— 1,^) we find 

With equation (3) we have reached a formal representation of the random 
sampling distribution of — ffs evaluation is a simple matter for large group 
intervals h and for parental distributions/(a;) with a tabulated probability integral 

f{x) dx. If we were to take the trouble of tabulating the probability integral 


/■ 


(3) we should obtain the mean of as a by-product from a summation of (3). 

It will be shown in the next section that this summation, if carried out analytic- 
ally, produces a very simple formula for this mean. 

3. The meah kanub in a grouped sample 

To find the mean of the distribution it is convenient to extend the summation 
in (3) from some finite negative value i = — j up to -f- oo. By choosing j sufficiently 
large the resulting error may be made negligible. We introduce 


°° [“ / ri-l- w\ «■ / 

f=-iLvi / Vf + l/J 


( 4 ) 
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and find for the ditference between P{m- 1) and 

-j /•i+i r-^+i 

\P{m-l)-p_j(m-l)\^ 2 « =n\ , (5) 

J-oo 

for all m and j. To find the mean of we must first note the probability for 
this statistic to be exactly equal to mh, where m = 0, 1, 2, .... Denoting this 
probability by <j}{m) we have from the definition of P(m) 

4>{m) = P(m) — P(m - 1 ). 

If we denote the mean of by S we have by definition 


S == h <l>(k)k 

fc=0 


= h lim {(m +• 1) P(m) - 

m-> 00 

(m+l)<l>{0) + m<p{l) + ... + <^{m) 


( 6 ) 


m-> 00 

where 

or = P(0) + P(l) + ... + P(m). (7) 

To find 3 let us first consider the second term in formula (6). We have from 
equations (7) and (6) 


=P-^(0) + ---+P-^{m) + ei, 

I 1 

where I I ^ ’ 

for all j and m. Now from the definition of p_j(k) we find 


m m I r-/ i’i+k+l\n / /•£+fc+l\n” 

)-(L )] 

m ( / r-H^+l\n ■» f-/ i'i+i+2\n / ri+i+l\n-] 

=. 5 .((L )l?-X(L )-fL )] 

m / l'-i+/c+l\n to / j'i+m+2\n 

-,5.(L )■ 


Putting now m = 2j, we have 

2j j+l / ri \n j+l /rooVji 

Sp-#)= s + s I , 

where it is easily seen that 

leg|<2>f \ |e3l<2ymf + S 

J-oo J^ + 2 i=j + l 

Finally, we want to replace in formula (6) the first term 

(2j+l)P{2j) by (2j+l). 


+ fig + 63 


/•i+l 


( 8 ) 

( 9 ) 


( 10 ) 

( 11 ) 

( 12 ) 


* In this section we deal with fixed group intervals and a fixed sample size n so that we drop 
the arguments n, h and i. 
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The resulting error is easily estimated. We have from (5) 
(2j + 1) (1 -P(2j)) - {2j+ 1 ) + 


where 


Ie4K(2i+l)w 


J. 


1-1 


( 13 ) 


Moreover, according to the definition (4) we may write 

/ 00 / ;i / j'i + 2j + ‘i\7i 


/ f+j-in ji 


y i-2\ j! 

I -1 1 


M-1 


00 I'i ! 2j I- 2 

where I I < (2i+ 1) D h ^n{2j + l) 

i=~j Jii'ij'.l J 

SO that finally we have 

(2i+I)(l-?J_,-(2i)) = -e5 + eo. 


(*rj 

1 

jn 


with 




(f- +r). 


(14) 


(15) 


The error terms ej, eg, 63 , e,t, and are of the form 


C'J 


pra p-3 

; CJ 

' } J —p 


or S (i-c) 

i ■-‘j 


i + 1 


It is easy to see that the above terms tend to 0 as j-^co. To prove this for the 
first term we write 


(t^jh 
h 




\ Pco P CO 1 r 

^ 7 Ji ^ 7 , 


xf(x) dx, 


which tends to 0 as f{x) has contact of order 1 at + 00 . The proof for the other 
terms is identical. For sufficiently larger’ we can therefore use the approximations 
given by ( 10 ) and ( 12 ) and transform equation ( 6 ) into the convenient form 


3 = h\im (2;4- 1) 


j+l / ri \n ]-\-l / ('(a\n 

~a>l i=“3-l-l\ji / 


(16) 


Equation (16) gives the mean range 5 in a grouped sample in terms of powers of 
the probability integral of the parental population. .For a normal distribution 
this is a particularly simple formula since such powers have already been cal- 
culated by L, H. C. Tippett (1925) and are conveniently tabulated in Table XXI 
of the Tables for Statisticians and Biometricians, Part II. A table oiS can, there- 
fore, be easily computed by adding a few entries from Tippett’s table and 
deducting the sum from the appropriate value of 2j+l. 

This has been done for samples of five, ten and twenty observations grouped 
in categories of breadth h (see table on p, 339). The parameter ^ denotes the 
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distance of the population mean X = 0 from the nearest group end-point, For 
given h the mean range E in grouped samples is obviously a symmetrical periodic 
function of £ with period h. The table has been extended to cover rather coarse 
grouping intervals (7i = 2-2cr) in order to illustrate the possible bias of range when 
estimated from frequency tables with as few as two or three categories . It is apparent 
that for small or moderate group intervals, say h^cr, the mean range is practically 
independent of h and so that no correction (corresponding to the well-known 
Sheppard’s correction for the sample standard deviation) is required for the 

Table of mean range in grouped samples drawn from a normal 
popidation having unit standard deviation 

Size of sample =». Width of group interval = A. 

Distance of population mean to nearest group-end point = ^. 


h 

i 

n=6 

w = 10 

n.=20 

0-2 

0-0 

2-32 693 

3-07 761 

3'73 496 

0-0 

0-0 

2-32 593 

3 07 760 

3'73 500 


0'2 

2-32 693 

3'07 761 

3'73 492 

I'O 

0-0 

2-32 632 

3-08 122 

372 917 


0-2 

2-32 674 

3-07 866 

3-73 317 


04 

2-32 642 

3-07 450 

3-73 962 

14 

O'O 

2'31 042 

3-06 204 

3-82 069 


0-2 

2'31 626 

3-06 787 

3-78 826 


04 

2-32 938 

3-08 095 

3-71 676 


0'6 

2-33 990 

3-09 143 

3-66 796 

1-8 

0-0 

2-29 227 

2-90 639 

3-67 974 


0-2 

2-30022 

2-94 491 

3-69 665 


04 

2-32 023 

3'04 621 

3-73 086 


0-6 

2-34 276 

3-16 366 

3-76 263 


0-8 

2-36 734 

3-24 140 

3-77 838 

2’2 

0-0 

2-36 011 

2-77 080 

3-27 609 


0-2 

2-36 652 

2-81 669 

3-34 611 


04 

2-34 221 

2-94 307 

3-63 896 


0-6 

2-32 266 

3-11 681 

3-79 662 


0-8 

2-30 266 

3-28 173 

4-03 847 


1-0 

2-28 963 

3-38 368 

4-18 462 


range. For h = 0'2cr the mean range in the grouped sample agrees with the theo- 
retical ungrouped range to five places of decimals (see Table of Mean Range, 
Table XXII of Tables for Statisticians and Biometricians, Part II). For coarse 
grouping the correction becomes important but depends on i (as well as on h). 
For fixed h, as i varies from — to -f \h the grouped mean range oscillates about 
the true mean range as a smooth single-period function. The reason for this is 
obvious. If I has a position such that the average positions of and both 
happen to fall within the outside halves of two group intervals, then will 
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on the average, be smaller than vice versa, if the average positions of 

and ajj are in the inside halves of two group intervals, there will be a pre- 
dominance of samples for which is larger than a;„ — x^. 

Moreover, as h increases, the grouped mean range becomes a less reliable 
estimate, and it can be shown that for h>cr the standard deviation of the 
random sampling 'distribution of the grouped range rapidly increases with h. 

We are thus led to consider two problems; one is the elimination of the para- 
meter ^ (or the dependence of the distribution on the position of the parental 
population mean); the other is to investigate more closely the random sampling 
distribution of the grouped range. Before dealing with these problems, however, 
we must first consider the distribution of the true range (range in the ungrouped 
sample). 


4. The peob ability integral or the range in random samples 


As before we denote by x-^, ...,x^ the observations in a random sample drawn 
from a parental distribution f{x) and arranged in ascending order of magnitude. 
The range in such a sample, defined as w = — x-^, may be regarded as the limit 

of the grouped range as h, the group interval, tends to 0, i.e. 

w = 

A ->0 


The probability integral of the range w, denoted by is therefore the limit 

of P{n,h, m-1, i), given by (3), as h tends to 0. To obtain this limit we write 
equation (3) as follows: 


+ «) / /*4+({+in)fc \n. / r^+(i+7n)/( \n 

P(n,h,m-1,^) = S ( f(x)dx) -I f(x)dx) 

i=-<o\Jg+ih } / 


+ 00 
= S 


(j: 


f{x)dx] mn, 
ft / 


where is some mean value between g -|- ih and g -1- (i -f 1 ) A. 

We now put m = Wjh or W — mh, 

and let h tend to 0, m to oo, keeping W constant. We obtain without difficulty 
P„(TT) = lim P{n,li,m—1,^) 

7n-'>oo 


r+oo / \7i~i 

= «J_^/(^)y^ f{x)dxj d^, (17) 

which is the required probability integral of the range. This integral may be 
compared with the expression for the distribution function of w which was given 
by A. T, McKay & E. S. Pearson (1933). It is easily verified that the function 
(^(w) given by these authors is the differential of P{W). The expression for P{W) 
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is decidedly simpler than that for <j>(w) which was used by Pearson for numerical 
work on this function. However, even P{W) is of a complex character, and only 
in special cases is it possible to evaluate it analytically. For the rectangular 
distribution function (/(a:) = 1, 0 1) this can be done easily. 


6. The peobability integeal of the eahge nr samples 

FEOM A NORMAL POPULATION 

Of particular interest is the case where the parental population is normal. 


In this case we have 


f{x) = z(x) = 




-ia:2 


(18) 


L. H. C, Tippett (1926), E. S. Pearson (1926, 1932) and A. T. McKay & E. S. Pear- 
son (1933) have considered this problem and carried out extensive numerical work. 
The method adopted was to calculate correct values of the means and standard 
deviations of the distributions (as functions of n) and then , with the help of approxi- 
mate values of and use as approximations to the unknown true distribution 
Pearson-type curves fitted by the method of moments. The numerical results, 
although they have been successfully tested by experimental sampling, have, 
of course, an unknown accuracy. It is therefore desirable to find a method which 
produces P^i W) to known and sufficient accuracy. 

We have P,^( W) = nj z(|) y z{x) dxj dg 

r-ifr r+po 

= n -)-» + h (say)' 

J J -w 

Writing ’/ = -(!+ W), -^-rj+W, 

we obtain PJW) = n{ «(-'>?- IT) ( f z(x)dx| dTj + I^. 

J -iw \J -(v+m / 

Using the symmetry of z(x) we may write 

P„(lF) = ri.J 2(5/-l-lP)y z(x)dxj d'li + Ii, 

and writing ^ as a variable of integration in place of we find 

/•to / rl+ff' 

P„(W) = 7ij_^^[z(|-l-F)-pz(0]y^ 

/•do / re+fy \»-i 

or P„(Tf) = -Mj 

(•to / rs+fy \»-i 

2nj z(^-l-IT)y^ z(x)dxl d^. 




( 19 ) 
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Integrating the first integral and introducing « = g + If in the second integral 
we finally obtain 


Pum 


/ r + iw \n 1*00 / yi -1 

= ( z{:x)dx\ +2n\ z(m)I z{x)dx\ 
U-iH' / JiW \ju-n' f 


du. 


( 20 ) 


for large values of W this is an approximate solution of the problem since the 
second term in (20) is small, so that the first term 



gives a fair approximation to This expression denotes the chance of 

observing samples with observations all lying between —^W and +^W; all 
these samples have a range smaller than or equal to W. for large W it is these 
samples which constitute an ever-increasing proportion of the total number of 
samples with range < W. 

The second term in (20), which is always positive, takes into account all those 
samples which are not contained in the interval — Ilf to +^W. This term cannot 
be ignored if high accuracy is required and if IF is small or moderate. Never- 
theless, the work involved in the numerical integration has been considerably 
reduced, for the range of integration is now from +^W to +co. 

The numerical integration of 


'to / f'u \n— 1 

2 («)( z{x)dx) du 
J iW \J u-W 1 


is best carried out simultaneously for values of n forming an arithmetical pro- 
gression. for fixed u and IF, the integrand is then given by the terms of a geo- 
metrical progression with, say, 


as first term and 


z{u)i\ 2 (a:)da;) 
\J u-w / 

(P z(x)do^ 

\J u-w / 


2 


as common ratio, Such a geometrical progression can be produced automatically 
by certain modern calculating machines. This forms the basic idea of the actual 
computation of the probability integral which is described in detail in another 
paper (pp. 309-10 above). 


6. The bahoe in bahdomxy aEouPEn samples 

The results of sections (2) and (3) on the effect of grouping on the distribution 
of range depend on the parameter 4, which denotes the origin of the equi- 
distant set of group end-points 

i + ih i = 0, 1,2, ..., 

- 1 ,- 2 , ..., 

given by equation (1). In practice, however, all we know is the category breadth, A. 
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We then select the actual group end-points g + ih from considerations which 
are, by necessity, independent of the position of the mean X of the parental 
population f{x), because this position will generally be unknown. One of our 
group end-points, however, is bound to fall into the interval 

X — to X -j- \h, 

wherever this interval may lie. Now, since the origin | in our system of group 
end-points is wholly arbitrary we may assume that for given Ji we have the 
inequality X-^h^^^X + ^h. 

The fact that our group end-points (and therefore their origin g) are chosen in- 
dependently of X has now to be expressed in mathematical terms. This is done 
by assuming that we are dealing with a population of values of ^ (being the origins 
of corresponding systems of group end-points) which are rectangularly distributed 
in the interval X— lh^^^X + ^h. This condition is exactly fulfilled where 
grouping has been introduced through rounding off of data (example ( 0 ) on 
p. 336), and it is often an appropriate assumption in the other examples, as in 
many otlier cases of grouping which occur in practice.* 

In order to derive the distribution of range in samples randomly grouped in 
the above sense we have to return to section (2). In this section we derived the 
probability P{n,h,m — 1,^), giving the chance that the difference between the 
centre points highest and lowest category covered by a sample 

of n items is at most {m—l)h, where h is the constant category breadth and 
group end-points are given by (1). The frequency distribution of which 
we may denote by ^^(w, h, m, |) is then given by 

h, m, i) = Pin, h, m, g) -P{n,h,m-1, i), 

and represents the chance that is exactly mh, given a particular value of |. 

The corresponding frequency distribution for random grouping may be denoted 
by (p{n, li, m). To derive it we may apply Bayes’s Theorem and obtain 

1 rxfift 
nj X-ih 

The resulting cumulative jjrobability may therefore be defined by 

m 1 ^X^-4A m 

Pin, h,m) = S ^in, /», j ) = t S ^,3, i) 

which yields the corresponding relation for the cumulative probabilities 

1 rx\-ih 

Pin,h,m) = Pin,h,m,^)d^. 

njx-ih 

* In certain coses, when grouping is very coarse, it may be advantageous to use an estimate 
3 of the population mean X (either dependent or independent of the sample whose range is con- 
sidered). The increase in information is akin to that given by an ancillary statistic in estimation 
theory. However, from the results in § 3 it would appear that little information is gained where 
the grouping interval is small or moderate. 
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Substituting, now, the expression (3) for P(7i, h, m, we obtain 
1 +00 rX+iA / / «+(£+m l-n/i 


P{n,h,m) = j S 

A£=_io J A'-!A [\Ji 

4i: 


S+iA 

M-(m-l-l)A 


f{x)dx\ 


\n / j'S+H+m+l)h \n\ 

’) - f{x)dx\ 

I \Js+(i + l)7i / J 


f{x)dxj ^ fix)dxj 


d^ 


(21) 


This formula may be reduced to a simpler form in which its relation to the pro- 
bability integral of the true range (17) becomes apparent. We introduce the 
second integral 

nv 

=J^ P,{w)dw 

I'W r + oo / fi+w \w-X 

= Jo wj f(x)dxj d^dw. (22) 

The first integration is with regard to w and the integrand is an integral with 
regard to If, now, in this latter integral — ^ + wiB used as variable of integra- 
tion in place of | we have 

™ f{^)dx\ dTjdw. 

J 0 J -00 \J i)-«) / 

We now note that the integrand may be written as a differential with regard to w. 
Thus, interchanging the order of integration we obtain 


r + co rw d, { 1 ™ 

r+oo f rv 

= f(x)dx\ dTj, 

J-oo [Jv-W J 


(23) 


thus eliminating integration with regard to w. The surviving variable of integra- 
tion ij may now be replaced by ^ — TF so that we reach the final result 


(•+«./ n+w 


f{x)dx\ di. 


(24) 


We now observe that this integral is identical with the one occurring in the 
expression (21) for P(n, h, m), and we note the relation 


Pin, 'h,m) = {Tn{m + lh) — Pni‘‘^h)} 


1 ('(m+Dli 

= T Pn{w)dw. (26) 

“J m/i 

This simple formula makes it possible to obtain the effect of random grouping on 
the probability integral of range P„( W). In particular, for normal samples for 
which P„(Tr) has been tabulated at the fine interval oi Aw — 0'05, the second 
integral P„( Tf ) is easily obtained numerically by summation of the tabular entries 
in the table on pp. 302-7 above, 
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For h->Q, m->oo, mh-^W, we note that, as expected, ?(%,/;., m)r:P„(lf); 
that is, as grouping becomes finer and finer the probability integral of the grouped 
range tends to that of the true range. What is not quite obvious, however, is the 
identity of mean grouped range and mean true range, no matter how large the 
breadth, h, of the randomly placed groups. This point we shall now examine. 

If we denote by w{n, h) the mean range in samples of n items classified in 
groups of breadth h randomly placed, we have by definition 
00 00 

w{n,h) = <f){n,h,m)mh= Yj {P{n,h,m)-P{n,'k,m-\)}’mh. (26) 

We now introduce central differences of the function FJW) and use the notation 
^m + i = Pni'>^+ lh)~Fj^(mh), d", = \h) - 2F^{mh) + F^{m -Ih), (27) 

so that we obtain from (26), (26) and (27) 


We may now write 


w{n,Ji) - 

7/1 = 1 

M 

w{n,h) = lim Y 

ni~l 


= lim jiWdW+i- 

W— ►co^ / 

= (28) 
iU->oo 


On the other hand, we have for the mean true range (w^ say) 

wf^{w}dw, 

d 

where f„iw) = -^P^(w) is the distribution function of the true range. We, 
therefore, have 


_ 

Wn = lim wfjw) dw 
Af->ooJ 0 


= lim \MhP^ 

M-*<d 


*Mh 

\{Mh)- Pn{v>)dw . 
Jo 


w{n,h}-w„=^ lim MhlTA'M+^-PJMh) 


On taking the difference of (28) and (29), we see that 


Now 

where 
80 that 


1 _ 1 /’(J'-f+W 


P^(w) dw = P^{Mh) +/„ {w*) h*, 


Mh<w* <(M +\)h and h*^h, 
w(n,h)~Wn\^ lim fJw*)Mh*. 

ilf— voo 


(29) 
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From this relation it is obvious that w{n, h) = w^, since /„(w) has contact of at 
least second order as t«->oo.* 

We have proved, therefore, that if grouinng is random the expectationf of 
the mean range w{n, h) (mean groixped range) is identical with the true mean range 
so that no bias is introduced through random grouping, no matter how large 
the grouping interval h. However, if we wish to use as an estimate of 

this estimate, although unbiased in the sense defined, becomes less and less 
reliable as h increases. This is borne out by its random sampling distribution or 
its probability integral P(n, h, m) given by (25). For normal samples it is an easy 
matter to tabulate P(n,h,m) from the table of P„(W) (pp. 302-7) and thus to 
follow up the numerical increase of its standard deviation as h increases. How- 
ever, to cover the case of a general parental distribution f{x), we shall derive an 
analytical formula for the variance of — ix from which approximate numerical 
results are easily obtained. 

In order to obtain this formula we consider the second moment of — ^x 
which we may denote by h). We have by definition and from equations (25), 
(26) and (27) 

M 1 

/i 2 (n,h) = lim 

M 

= lim S (30) 

Now we may write 

M M M 

m“(l 7n=0 m=0 

M-1 M-1 

= + S {'in + ^)A',n.x.^ — MF^^(Mh)+ 2 

m=0 m=o 

M-1 

= M{M + l)A'j^+i-{m~i)FJMh) + 2 S Fnimh). (31) 

w=0 

Using equation (31) we obtain for the second moment (30) 
h) ^ Jim^ \{Mnf ^ 1 +-^j - 2MhF,,{Mh) 1 1 

(32) 

This formula enables us to compare /ta(n, h) with /^a(w), the second moment of 
the distribution of the true range. We have by definition 

rMh 

/ialix) = lim fjw)w^dw, 

i1f->coJ 0 

* It can be proved that the order of oontact of/„(w) is the same as that of the parental dis- 
tribution f(x). 

t If repeated samples were drawn from the same population and the same grouping system 
used in each case, the mean grouped range would be biased by an unknown amount. But in repeated 
experience with different populations the expectation of this bias is zero. 
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wliioh ^ve transform by two partial integrations into the equation 
H4)i) = lim \[Mh)^P„{Mh)-2{Mh)F„{Mh) + 2 

Jg 

Talcing the difference of (32) and (33) we obtain 




= II, „ 


i»r->cc \ 2 


"1 


-ir-i 


+ lim 2/i S F,Xmh) + hF„{Mh)-2 
.1/-J-CC I >n—u , 


m 


F„(w) dw 


(33) 


(34) 


Since the expected mean value of the grouped range is the same as for the 
true range, the difference in second moments about zero equals the difference in 
variances. The first term on the right-hand side of equation (34) is obviously 0 
(see equation (29)), whilst the second term is best evaluated with the help 
of Gregory’s fornuda for numerical integration (see e.g. L. J. Comrie, 1936, 
p. 809). Using this formula we can expre.ss the difference between the integral 
and the finite .sum in equation (34) in term,s of the differences of the integrand 
F^^{w) at the tw'o ends of the range of integration. We obtain 




Mn,h)-J(i{n) = lim 

M~>yj ^ IZ. 


.u-r 




ih{h-A\) + A,^hAl+.,., 


(35) 


provided the Gregory expansion is convergent.* A\, zl^', zlj", ... are advancing 
differences of the function F^{w) at w = 0. 

Equation (3.'5) yields the desired formula for the second moment of 
For most parental distributions the resulting probability integral of the range 
will be practically 0 for a certain range in the neighbourhood of the origin (see for 
instance the behaviour of P^( W) from a normal parent given in the table on 
pp. 302-7). For such parents and for moderate h we have 

^... ^0, 

so that Fiin, h) — /(^(n) s (36) 

For small or moderate values of h, therefore, the increase in variance of the 
grouped range is given airproximately (and for most parents to a high degree of 
accuracy) by This iircrease is double the amount given by the well-known 
Sheppards correction of Indeed, had we grouped a sample of true ranges w 
in fixed categories of breadth h, the resulting second moment of the grouped 
distribution of w would have an expectation which is ■^h'^ in excess of the second 
moment of the true range. With random grouping of the original sample (as it 

* This condition is as a rule fulfilled for values of h which do not exceed the standard deviation 
of the parental distribution f{x). 



348 Tk range k random samples 

has been defined above in accordance with common practice of grouping) an 
additional uncertainty is introduced by using a new, randomly selected, set of 
group intervals each time a new grouped range is determined, This additional 
uncertainty has been proved roughly to double the excess of the variance and the 
result is an increase of over the variance of the true range. 

I wish to acknowledge with gratitude the helpful suggestions and criticisms 
made by Drs J, Wishart and J. 0. Irwin and by Professor E. S. Pearson at 
various stages of this investigation. 
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(i) The Second Yearbook of Research and Statistical Methodology Books 
and Reviews. Edited by Oscab Kbiseu Bttbos. The Gryphon Press, 
Highland Park, New Jersey, 1941. $5. 

This is a second and much enlarged issue of a volume published in 1938. It contains 
nearly seventeen hundred review excerpts on 346 statistical and allied books (in the 
English language only), extracted from 283 different journals. The editor has attempted 
with considerable success to cover the whole field of statistical and probability theory, as 
well as their applications in every possible direction. He has also included reviews of a 
number of books on the general history of science, on scientific method and on the social 
relations of science on the ground that they are — or should be — of general interest to 
scientific workers in every special field. Included in this category are books such as 
J. D. Bernal’s The Social Function of Science, J. G. Crowther’s The Social Relations of 
Science and J. B. S. Haldane’s The Marxial Philosophy and the Sciences. 

This large volume of several hundred pages has been admirably produced and arranged. 
It is intended to publish a fresh volmne every two years containing reviews that have 
appeared in the interval. The Preface sets out a variety of reasons which, in the Editor’s 
opinion, justify the present venture and oven its enlargement in the future if sufficient 
support is forthcoming ; at the same time frank expressions of opinions are asked for from 
readers and reviewers. 

The objectives of the Yearbook as set out may be classed under four general heads: 

(1) To help students, teachers and librarians to select text-books with greater dis- 
crimination and to point out to them the weak and strong points of particular books. 

(2) To indicate the width of the .subject of statistics and the many fields in which it 
is applied. 

(3) To make students and teachers aware of the inadequacy of much that is now pre- 
sented in text books and elassea ; to discourage the publication of books written by persons 
ignorant of the latest developments in their subject. 

(4) To improve the quality of reviews by encouraging editors and reviewers alike to 
take their responsibilities more seriously. 

With the last three objectives it is hardly possible to quarrel, and it is likely that the 
wide circulation of this volume would provide one of the most direct methods of attaining 
these ends. The first objective is, however, presumably the most important, and there are 
bound to be differences of opinion oii the probable success of the book in this direction. In 
the ordinary event the teacher will no doubt be made aware of new books in the field with 
which he is concerned by reading the notices in one or two journals specially devoted to his 
subjeot. Having obtained a suggestion of a likely book he must surely get hold of it and 
determine by reading it himself whether it is suitable and abreast of the latest develop- 
ments. If he is not competent to do this, but must base his decision on the advice of 6-10 
reviewers, it seems doubtful whether he should be teaching the subjeot at all. 

After reading through the reviews on some dozen books contained in the present 
Yearbook, 1 am inclined to the following conclusions, Kegarding books of outstanding but 
perhaps rather controversial character, as those of Harold Jeffreys and Richard von Mises, 
the reader will certainly gain a useful impression from the eolleoted reviews. This is partly ■ 
because in such cases the standing of the reviewers is high and their reviews interesting and- ; 
fairly written, even if critical. But in the cose of the more elementary text book,: th^ 
position is rather different. Quite often the opinions expressed are diametrically oppo^*®. 

In cases where I knew nothing of the book or its author I found myself inevitably forced 
to form an opinion from my own personal knowledge of the experience, the special interests 
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ftnd oven the character of the reviewer. Such inside information will generally not be 
possessed l)y the- College instructor and certainly not by the student. We cannot, I think, 
escape the conclusion that the teacher who has to select a text book for his students must 
be competent to decide on its merits himself and, if he is not, he may be only confused by 
the varied opinions contained in the Yearbook, 

Three future directions- in which the volume might be enlarged are contemplated: 

(1) The inclusion of reviews of foreign language (i.e, not English) books, 

(2) The addition of a section devoted to non-critical abstracts of periodical literature 
on research and statistical methods. 

(3) The publication of original criticisms by one or more persons (according to the 
importance and controversial nature ) of articles and papers in the periodical literature. 

The first addition is clearly desirable; the .publication of translations of reviews in 
foreign journals of our own American and British books would probably be useful too ; it 
would help us to see ourselves as others see us. With regard to the second and third pro- 
posals, the great difficulty is of course to secure the services of sufficiently competent 
abstractors or critics for so large an undertaking. If, quoting the Editor, the statistical 
student and teacher are to be kept ‘abreast of modem developments in statistical theory’ ; 
to be warned ‘to ignore much of the literature which either presents nothing new or presents 
inefficient or incorrect methods of statistical analysis’; to be told what are ‘sloppy, value- 
less, and erroneous articles’ and what are ‘well --written, significant contributions’, it is 
clear that a very great responsibility -will lie on the Editor of the Yearbook and his col- 
laborators. As Prof, Buros indicates, the organizing and editing of such a comprehensive 
service would need the support of a foundation Interested in fostering the advancement of 
research. Indeed, to avoid duplication the organization must be built up on an inter- 
national basis, possibly in collaboration with such bodies as the American Statistical Asso- 
ciation and the Royal Statistical Society between whose representatives some discussion 
of a similar project took place a few years ago. E S. P 
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